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TO MY PARENTS 


PREFACE TO VOLUME I 
OF THE FIRST EDITION 


The present work is the first volume of a substantially enlarged 
version of the mimeographed notes of a course of lectures first given 
by me in the Indian Statistical Institute, Calcutta, India, during 
1964-65. When it was suggested that these lectures be developed into 
a book, I readily agreed and took the opportunity to extend the scope 
of the material covered. 

‘No background in physics is in principle necessary for understand- 
ing the essential ideas in this work. However, a high degree of 
mathematical maturity is certainly indispensable. It is safe to say 
that I aim at an audience composed of professional mathematicians, 
advanced graduate students, and, hopefully, the rapidly increasing 
group of mathematical physicists who are attracted to fundamental 
mathematical questions. 

Over the years, the mathematics of quantum theory has become 
more abstract and, consequently, simpler. Hilbert spaces have been 
used from the very beginning and, after Weyl and Wigner, group 
representations have come in conclusively. Recent discoveries seem to 
indicate that the role of group representations is destined for further 
expansion, not to speak of the impact of the theory of several complex 
variables and function-space analysis. But all of this pertains to the 
world of interacting subatomic particles; the more modest view of the 
microscopic world presented in this book requires somewhat less. The 
reader with a knowledge of abstract integration, Hilbert space theory, 
and topological groups will find the going easy. 

Part of the work which went into the writing of this book was 
supported by the National Science Foundation Grant No. GP-5224. I 
have profited greatly from conversations with many friends and 
colleagues at various institutions. To all of them, especially to R. 
Arens, R. J. Blattner, R. Ranga Rao, K. R. Parthasarathy, and S. R. S. 
Varadhan, my sincere thanks. I want to record my deep thanks to 
my colleague Don Babbitt who read through the manuscript carefully, 
discovered many mistakes, and was responsible for significant im- 
provement of the manuscript. My apologies are due to all those whose 
work has been ignored or, possibly, incorrectly (and/or insufficiently) 
discussed. Finally, I want to acknowledge that this book might never 
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have made its way into print but for my wife. She typed the entire 
manuscript, encouraged me when my enthusiasm went down, and 
made me understand some of the meaning of our ancient words, 


TAANIT AT WAY AZTAT | .* 


To her my deep gratitude. 


Spring, 1968 V. S. VARADARAJAN 


* Bhagavadgita, 2:47a. 


PREFACE TO THE SECOND EDITION 


aAA uR AT BAG ATA l 


It was about four years ago that Springer-Verlag suggested that a revised 
edition in a single volume of my two-volume work may be worthwhile. 
I agreed enthusiastically but the project was delayed for many reasons, one 
of the most important of which was that I did not have at that time any 
clear idea as to how the revision was to be carried out. Eventually I decided 
to leave intact most of the original material, but make the current edition a 
little more up-to-date by adding, in the form of notes to the individual 
chapters, some recent references and occasional brief discussions of topics 
not treated in the original text. The only substantive change from the earlier 
work is in the treatment of projective geometry; Chapters II through V of 
the original Volume I have been condensed and streamlined into a single 
Chapter II. I wish to express my deep gratitude to Donald Babbitt for 
his generous advice that helped me in organizing this revision, and to 
Springer-Verlag for their patience and understanding that went beyond 
what one has a right to expect from a publisher. 

I suppose an author’s feelings are always mixed when one of his books that 
is comparatively old is brought out once again. The progress of Science in our 
time is so explosive that a discovery is hardly made before it becomes 
obsolete; and yet, precisely because of this, it is essential to keep in sight the 
origins of things that are taken for granted, if only to lend some perspective 
to what we are trying to achieve. All I can say is that there are times when 
one should look back as well as forward, and that the ancient lines, part of 
which are quoted above still capture the spirit of my thoughts. 


Pacific Palisades, V.S. VARADARAJAN 
Dec. 22, 1984 


* Bhagavadgita, 2:47a. 
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INTRODUCTION 


As laid down by Dirac in his great classic [1], the principle of 
superposition of states is the fundamental concept on which the quan- 
tum theory of atomic systems is to be erected. Dirac’s development 
of quantum mechanics on an axiomatic basis is undoubtedly in keep- 
ing with the greatest traditions of the physical sciences. The scope 
and power of this principle can be recognized at once if one recalls 
that it survived virtually unmodified throughout the subsequent transi- 
tion to a relativistic view of the atomic world. It must be pointed out, 
however, that the precise mathematical nature of the superposition 
principle was only implicit in the discussions of Dirac; we are in- 
debted to John von Neumann for explicit formulation. In his charac- 
teristic way, he discovered that the set of experimental statements of 
a quantum mechanical system formed a projective geometry—the 
projective geometry of subspaces of a complex, separable, infinite 
dimensional Hilbert space. With this as a point of departure, he 
carried out a mathematical analysis of the axiomatic foundations of 
quantum mechanics which must certainly rank among his greatest 
achievements {1} [8} [4] [5] [6]. 

Once the geometric point of view is accepted, impressive conse- 
quences follow. The automorphisms of the geometry describe the 
dynamical and kinematical structure of quantum mechanical systems, 
thus leading to the linear character of quantum mechanics. The 
covariance of the physical laws under appropriate space-time groups 
consequently expresses itself in the form of projective unitary repre- 
sentations of these groups. The economy of thought as well as the 
unification of method that this point of view brings forth is truly 
immense; the Schrödinger equation, for example, is obtained from a 
representation of the time-translation group, the Dirac equation from 
a representation of the inhomogeneous Lorentz group. This develop- 
ment is the work of many mathematicians and physicists. However, 
insofar as the mathematical theory is concerned, no contribution is 
more outstanding than that of Eugene P. Wigner. Beginning with 
his famous article on time inversion and throughout his great papers 
on relativistic invariance {1} [3] [4} [5] [6], we find a beautiful and 
coherent approach to the mathematical description of the quantum 
mechanical world which achieves nothing less than the fusion of 
group theory and quantum mechanics, and moreover does this without 


TY 


xvi INTRODUCTION 


compromising in any manner the axiomatic principles formulated by 
Dirac and von Neumann. 

My own interest in the mathematical foundations of quantum me- 
chanics received a great stimulus from the inspiring lectures given by 
Professor George W. Mackey at the University of Washington in 
Seattle during the summer of 1961. The present volumes are in great 
part the result of my interest in a detailed elaboration of the main 
features of the theory sketched by Mackey in those lectures. In sum, 
my indebtedness to Professor Mackey’s lectures and to the books and 
papers of von Neumann and Wigner is immense and carries through 
this entire work. 

There exist today many expositions of the basic principles of 
quantum mechanics. At the most sophisticated mathematical level, 
there are the books of von Neumann [1], Hermann Weyl [1] and 
Mackey {1}. But, insofar as I am aware, there is no account of the 
technical features of the geometry and group theory of quantum me- 
chanical systems that is both reasonably self-contained and comprehen- 
sive enough to be able to include Lorentz invariance. Moreover, recent 
re-examinations of the fundamental ideas by numerous mathematicians 
have produced insights that have substantially added to our under- 
standing of quantum foundations. From among these I want to single 
out for special mention Gleason’s proof that quantum mechanical states 
are represented by the so-called density matrices, Mackey’s extensive 
work on systems of imprimitivity and group representations, and Barg- 
mann’s work on the cohomology of Lie groups, particularly of the 
physically interesting groups and their extensions. All of this has made 
possible a conceptually unified and technically cogent development of 
the theory of quantum mechanical systems from a completely geometric 
point of view. The present work is an attempt to present such an 
approach. 

Our approach may be described by means of a brief outline of the 
contents of the three parts that make up this work. The first part begins by 
introducing the viewpoint of von Neumann according to which every 
physical system has in its background a certain orthocomplemented lattice 
whose elements may be identified with the experimentally verifiable prop- 
ositions about the system. For classical systems this lattice (called the 
logic of the system) is a Boolean o-algebra while for quantum systems it is 
highly nondistributive. This points to the relevance of the theory of 
complemented lattices to the axiomatic foundations of quantum mechanics. 
In the presence of modularity and finiteness of rank, these lattices decom- 
pose into a direct sum of irreducible ones, called geometries. A typical 
example of a geometry is the lattice of subspaces of a finite dimensional 
vector space over a division ring. The theory of these vector geometries is 
taken up in Chapter II. The isomorphisms of such a geometry are induced in 
a natural fashion by semilinear transformations. Orthocomplementations 
are induced by definite semi-bilinear forms which are symmetric with 
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respect to suitable involutive anti-automorphisms of the basic division ring. 
If the division ring is the reals, complexes or quaternions, this leads to the 
Hilbert space structures. In this chapter, we also examine the relation 
between axiomatic geometry and analytic geometry along classical lines 
with suitable modifications in order to handle the infinite dimensional case 
also. The main result of this chapter is the theorem which asserts that an 
abstractly given generalized geometry (i.e., one whose dimension need 
not be finite) of rank >4 is isomorphic to the lattice of all finite 
dimensional subspaces of a vector space over a division ring. The 
division ring is an invariant of the lattice. 

The second part analyzes the structure of the logics of quantum mechani- 
cal systems. In Chapter III, we introduce the notion of an abstract logic 
(= orthocomplemented weakly modular o-lattice) and the observables and 
states associated with it. It is possible that certain observables need not be 
simultaneously observable. It is proved that for a given family of observ- 
ables to be simultaneously measurable, it is necessary and sufficient that the 
observables of the family be classically related, i.e., that there exists a 
Boolean sub o-algebra of the logic in question to which all the mem- 
bers of the given family are associated. Given an observable and a 
state, it is shown how to compute the probability distribution of the 
observable in that state. In Chapter IV, we take up the problem of singling 
out the logic of all subspaces of a Hilbert space by a set of neat axioms. 
Using the results of Chapter II, it is proved that the standard logics are 
precisely the projective ones. The analysis of the notions of an observable 
and a state carried out in Chapter III now leads to the correspondence 
between observables and self-adjoint operators, and between the pure states 
and the rays of the underlying Hilbert space. The automorphisms of the 
standard logics are shown to be induced by the unitary and antiunitary 
operators. With this the von Neumann program of a deductive description 
of the principles of quantum mechanics is completed. The remarkable fact 
that there is a Hilbert space whose self-adjoint operators represent the 
observables and whose rays describe the (pure) states is thus finally 
established to be a consequence of the projective nature of the underlying 
logic. 

The third and final part of the work deals with specialized questions. The 
main problem is that of a covariant description of a quantum mechanical 
system, the covariance being with respect to suitable symmetry groups of 
the system. The theory of such systems leads to sophisticated problems of 
harmonic analysis on locally compact groups. Chapters V, VI, and VII are 
devoted to these purely mathematical questions. The results obtained are 
then applied to yield the basic physical results in Chapters VIII and IX. 
In Chapter VIII, the Schrédinger equation is obtained and the relations 
between the Heisenberg and Schrédinger formulations of quantum 
mechanics are analyzed. The usual expressions for the position, momentum, 
and energy observables of a quantum mechanical particle are shown to be 
inevitable consequences of the basic axioms and the requirement of 
covariance. In addition, a classification of single particle systems is obtained 
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in terms of the spin of the particle. The spin of a particle, which is so charac- 
teristic of quantum mechanics, is a manifestation of the geometry of the 
configuration space of the particle. 

The final chapter discusses the description of free particles from the 
relativistic viewpoint. The results of Chapters V, VI and VII are used to 
obtain a classification of these particles in terms of their mass and spin. 
With each particle it is possible to associate a vector bundle whose square 
integrable sections constitute the Hilbert space of the particle. These 
abstract results lead to the standard transformation formulae for the (one 
particle) states under the elements of the relativity group. By taking 
Fourier transforms, it is possible to associate with each particle a definite 
wave equation. In particular, the Dirac equation of the free electron is 
obtained in this manner. The same methods lead to the localization in space, 
for a given time instant, of the particles of nonzero rest mass. The chapter 
ends with an analysis of Galilean relativity. It is shown that the free 
particles which are governed by Galilei’s principle of relativity are none 
other than the Schrödinger particles of positive mass and arbitrary spin. 

With this the program of obtaining a geometric view of the quan- 
tum mechanical world is completed. It is my belief that no other ap- 
proach leads so clearly and smoothly to the fundamental results. It 
may be hoped that such methods may also lead to a successful de- 
scription of the world of interacting particles and their fields. The 
realization of such hopes seems to be a matter for the future. 


V. S. VARADARAJAN 


CHAPTER I 


BOOLEAN ALGEBRAS ON A CLASSICAL 
PHASE SPACE 


1, THE CLASSICAL PHASE SPACE 


We begin with a brief account of the usual description of a classical 
mechanical system with a finite number of degrees of freedom. Associated 
with such a system there is an integer n, and an open set M of the n- 
dimensional space A” of n-tuples (x1, £2, ***, £p) of real numbers. n is 
called the number of degrees of freedom of the system. The points of M 
represent the possible configurations of the system. A state of the system 
at any instant of time is specified completely by giving a 2n-tuple 
(£i, a*t, La, Pis ° ts Pn) Such that (x,,---, £) represents the configura- 
tion and (p4, +, Pa) the momentum vector, of the system at that instant 
of time. The possible states of the system are thus represented by the 
points of the open set M x R” of R?”. The law of evolution of the system is 
specified by a smooth function H on M x R", called the Hamiltonian of 
the system. If (x(t), -< -, 2,(t), p(t), -© +, Pa(t)) represents the state of the 
system at time t, then the functions 2,(-), p,(-), 1=1,2,.::, n, satisfy 
the well known differential equations: 


dx, oH 


it = Op, = 1, 2, » nN, 
” d oH 

oP, = . 

dt — Om, v= 1,2, 


For most of the systems which arise in practice these equations have 
unique solutions for all ¢ in the sense that given any real number tọ, 
and a point (x,°, %°,--+,%,°, P10, ***, Pa?) of Mx R”, there exists a 
unique differentiable map t —> (x,(t),---, a(t), pil), +--+, Pa(t)) of R* into 
M x R” such that z,(-) and p,(-) satisfy the equations (1) with the initial 
conditions 


(2) %4(to) = x,°, pilto) = pi; = l, 2, NA, 


If we denote by s an arbitrary point of M x R”, it then follows in the 
standard fashion that for any ¢ there exists a mapping D(t)(s —> D(t)s) 
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of M x R” into itself with the property that if s is the state of the system 
at time tọ, D(t)s is the state of the system at time t+t,. The transfor- 
mations D(t) are one-one, map M x R” onto itself and satisfy the equations: 


D(0) = I (the identity mapping), 
(3) D(—t) = Dit)~*, 
D(t, +t2) = D(t,)D(t2). 


If, in addition, H is an indefinitely differentiable function, then the D(t) 
are also indefinitely differentiable and the correspondence t—> D(t) 
defines a one-parameter differentiable transformation group of M x R” so 
that the map t,s—> D(t)s of R'x Mx R” into M x R” is indefinitely 
differentiable. The set M x R” of all the possible states of the system is 
called the phase space of the system. 

In the formulation described above, the physical quantities or the 
observables of the system are described by real valued functions on M x R”. 
For example, if the system is that of a single particle of mass m which 
moves under some potential field, then n =3, M = R’, and the Hamiltonian 
H is given by 


l 
(4) FI (x1 ,%2,%3)P1,Po,P3) = om (P? +P? +97) + V(x,,%0,%). 


The function s —> (P? +p,? +p3°)/2m is the kinetic energy of the particle 
and the function s —> V(2,,%2,x73) is the potential energy of the particle. 
The function s —> p, (i=1, 2, 3) represents the 2z,-component of the 
momentum of the particle. In the general case, if f is a function on M x R" 
which describes an observable, then f(s) gives the value of that observable 
when the system is in the state s. 

This formulation of the basic ideas relating to the mechanics of a 
classical system can be generalized significantly (Mackey [1], Sternberg 
[1]). Briefly, this generalization consists in replacing the assumption that 
M is an open subset of R” by the more general one that M is an abstract 
C” manifold of dimension n. The set of all possible configurations of the 
system is now M, and for any x e M, the momenta of the system at this 
configuration are the elements of the vector space M,*, which is the dual 
of the tangent vector space M, of M at x. The phase space of the system 
is then the set of all possible pairs (z,p), where x e M and pe M,*. This 
set, say S, comes equipped with a natural differentiable structure under 
which it is a C” manifold of dimension 2n, the so-called cotangent bundle 
of M. The manifold S admits further a canonical 2-form which is every- 
where nonsingular and this gives rise to a natural isomorphism J of the 
module of all C” vector fields on S onto the module of all 1-forms (both 
being considered as modules over the ring of C° functions on S). The 
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dynamical development of the system is then specified by a C” function 
H on 8S, the Hamiltonian of the system. If t —> s(t) is a curve representing 
a possible evolution of the system, then we have the differential equations: 


6) BO [J-4dH)\s(0). 


Here ds(t)/dt is the tangent vector to S at the point s(t) along the curve 
t —> s(t) and J~+(dH) is the vector field on S corresponding to the 1-form 
dH; the right side of the equation (5) being the value of this vector field 
at the point s(t) of S. In the special case when M is an open set in R” and 
Lis Ta ***, Za are the global affine coordinates on M, S is canonically 
identified with M x R” and, under this identification, J goes over into the 
map which transforms the vector field 


2, Aila/an)+ > Belay) 
=] i=l 


into the 1-form 
n n 
— > B, dx, + > A, ap,. 
i=l i=] 


The equation (5) then goes over to (1) (cf. Chevalley [1], Helgason [1] for 
a discussion of the general theory of differentiable manifolds). 

In this general setup, the dynamical development of the system is given 
by the integral curves of the vector field J~*(dH). It is necessary to 
assume that the integral curves are defined for all values of the time 
parameter ¢. One can then use the standard theory of vector fields to 
deduce the existence of a diffeomorphism D(t) of S for each t such that 
the correspondence ¢-—> D(t) satisfies the conditions (3), and the map 
t, s —> D(t)s of R! xS into S is C”. If the system is at the state s at time 
to, then its state at time ¢+t, is D(t)s. The physical observables of the 
system are then represented by real valued functions on S. A special class 
of Hamiltonian functions, analogous to (4), may be defined in this general 
framework. Let x > <.,.>, be a C® Riemannian metric on M, ¢., .>z 
being a positive definite inner product on M,x M. For each x e M, we 
then have a natural isomorphism p — p* of M,* onto M, such that 
p(u)=<u,p*>, for all pe M,* and for all we M,. The analogue of (4) is 
then the Hamiltonian H given by 


(6) H(x,p) = Cp* ps + V(x), 


where V is a C” function on M. The function xv, p —> ¢(p*,p*>, then 
represents the kinetic energy of the system in question. 

It may be pointed out that one can introduce the concept of the 
momenta of the system in this setup. Let 


(7) y: t—> y 
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be a one-parameter group of symmetries of the configuration space M, 
i.e., y(t > yı) is a one-parameter group of C” diffeomorphisms of M onto 
itself such that the map t, x —> y(x) of R! x M into M is C”. The infinitesi- 
mal generator of y is a O” vector field, say X,, on M; for any xe M and 
any real valued C°” function f defined around zx, 


(Xe) = {gronh 


X, defines, in a natural fashion, a C°” function pu, on S. In fact, if x e M 
and pe M,*, 
u(x, p) = p(X ,(2)) 


(here X,(x) denotes the tangent vector to M at x which is the value of 
X, at x). The observable corresponding to the function p, is called the 
momentum of the system corresponding to the one-parameter group of 
symmetries y. If M = R”, if z4, * +, £, are the global affine coordinates on 
M, and if 

Ye (Lis ts En) = Ly — tC, tty Ea — tens 


then the observable corresponding to u,e is called the component of the 
linear momentum along (c,,--+,c,). In the same case, if 


yit, “fy Xp) = (Yis sey Yn) 
where 


Y, = 4, r# 1,5, 


Yı = x, cost+2,sint, y; = —%,sint+2, cost, 


then the observable corresponding to u,!.1 is called the angular momentum 
with respect to a rotation in the i-j plane. A straightforward calculation 
shows that in the case when M = Rk", S = R” x R”, and x,-+-+, Ens Dy,°°°) Da 
are global coordinates on S ((%1,+++, ns Pis ***, Pn) depicts ST. p,(dx;,)~), 


Hye(X, p) = CPi t'e +CnDy, 
and 


py l£ p) = Xp, — tpi 


Suppose now that M is a general C® manifold and S its cotangent bundle. 
If f and g are two C® functions on S, then we can form J~ (df), which is a 
C@ vector field on S, and apply it to g to get another C® function on 8, 
denoted by [f.g]: 


(8) [fig] = (J= (df). 
[f.g] is called the Poisson Bracket of f with g. If we use local coordinates 
%,,:°+, 2, on M and the induced coordinates 2,,-+-,2%,, Pis *'*, Pn On S 


(so that (%,,--+,%n,Pi.°**, Pa) represents >, p,(dx,)), then J goes over 
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into the map which (locally) sends 5, A,(0/0%,)+ >, B(ə/əp;) into 
—>, Bı dx; +2; 4, dp, and [f,g] becomes 


əf og Of ag 
(9) [fg] = 2 (= Bu, dm, 5). 


The map f, g —> [f,g] is bilinear, skew symmetric, and satisfies the identity 


(f.9],4) +[[9.4],f]+[[4,fl.9] = 9, 
as is easily verified from (9). If X is any C°” vector field on M and pyx is 
the C” function on S defined by 
pxl% p) = p(X(2)), 
then one can verify, using (9), that 


Max+oy = Alx tbpy (a, b constants), 


Mix,y) = [ux,y], 


where [X,Y] is the Lie bracket of the vector fields X and Y. If f is any 
C°” function on M and f° is the lifted function on £, i.e., 


f(e,p) = f(z); 


then we may use (9) once again to check that 
[exf] = (Xf)? 


for any C® vector field X on M. 

In many problems, there is a Lie group G which acts on M and provides 
the natural symmetries of the problem. For g e G we write x —>g-zx for 
the symmetry associated with g and assume that g, x — g-x is C® from 
Gx M into M. In such problems, one restricts oneself to the momenta 
specified by the one-parameter groups of M. If g is the Lie algebra of G 
(cf. Chevalley [1]) and if we associate for X e g, the vector field on M 
denoted by X also and defined by 


(Za) = (FflexptX-2)) 
then we obtain the relations 
Hixxy) = [exer], 
[mx.f°] = (Xf)° 


between the configuration observables f° and the momentum observables 
ux. These relations are usually referred to as commutation rules. 


(10) 


6 GEOMETRY OF QUANTUM THEORY 


2. THE LOGIC OF A CLASSICAL SYSTEM 


We shall now examine the algebraic aspects of a general classical system. 
In view of the discussion carried out just now, it is clear that for any 
classical system © there is associated a space S called the phase space of 
©. The states of the system are in one-one correspondence with the points 
of S. The notion of a state is so formulated that if one knows the state 
of the system at an instant of time tọ, and also the dynamical law of 
evolution of the system, then one can determine the state of the system 
at time t +t. The observables or physical quantities which are of interest 
to the observer are then represented by real valued functions on S. If f is 
the function corresponding to a particular observable, its value f(s) at the 
point s of S is interpreted as the value of the physical quantity when the 
system is in the state s. If s is the state of the system at time tọ, we can 
write D(t)s for the state of the system at time t+¢,. We thus have a trans- 
formation D(t) of S into itself. For each t, D(t) is invertible and maps S 
onto itself. The correspondence t—> D(t) satisfies the equations (3). 
t — D(t) is then a one-parameter group of transformations of S. It is called 
the dynamical group of the system ©. 

These concepts make sense in every classical system. In the case of any 
such system the most general statement which can be made about it is 
one which asserts that the value of a certain observable lies in a real 
number set H. If the observable is represented by the function f on S, 
then such a statement is equivalent to the statement that the state of the 
system lies in the set f~*(£) of the space S. In other words, the physically 
meaningful statements that can be made about the system are in corre- 
spondence with certain subsets of S. The inclusion relations for subsets 
naturally correspond to implications of statements. In mathematical 
terms, this means that at the background of the classical system there is a 
Boolean algebra of subsets of the space S, the elements of which represent 
the statements about the physical system. It is natural to call this Boolean 
algebra the logic of the system. 

Suppose now that © is a system which does not follow the laws of 
classical mechanics. Then one cannot associate with it a phase space in 
general. It is nevertheless meaningful to consider the totality of experi- 
mentally verifiable statements which may be made about the system. 
This collection, which may be called the logic of GS, comes equipped with 
the relations of implication and negation which convert it into a com- 
plemented partially ordered set. For a classical system this partially 
ordered set is a Boolean algebra. Clearly, it is possible to conceive of 
mechanical systems whose logics are not Boolean algebras. We take the 
point of view that quantum mechanical systems are those whose logics form 
some sort of projective geometries and which are consequently nondistributive 
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lattices. With such a point of view it is possible to understand the role 
played by simultaneously observable quantities, the uncertainty relations, 
and the complementarity principles. These phenomena, which are so 
peculiar to quantum systems, will then be seen to be consequences of the 
nondistributive nature of the logic in the background of the system G. 

It might seem a bit surprising that the basic assumption on a quantum 
system is that its logic is not a distributive lattice. It would be natural to 
argue that statements about a physical system should obey the same 
rules as the rules of ordinary set theory. The well known critiques of von 
Neumann and Heisenberg address this question (von Neumann [1], 
Birkhoff-von Neumann [1], Heisenberg [1]). The point is that only 
experimentally verifiable statements are to be regarded as members of the 
logic of the system. Consequently, as it happens in many questions in 
atomic physics, it may be impossible to verify experimentally statements 
which involve the values of two physical quantities of the system—for 
example, measurements of the position and momentum of an electron. 
One can verify statements about one of them but not, in general, those 
which involve both of them. What the basic assumptions imply is that 
the statements regarding position or momentum form two Boolean sub- 
algebras of the logic but that there is in general no Boolean algebra which 
contains both of these Boolean subalgebras. 

Before beginning an analysis of the logic of general quantum mechanical 
systems it would be helpful to recast at least some of the features of the 
formulation given in section 1 in terms of the logic of the classical system. 
In the first place it is natural to strengthen the hypothesis and assume 
that the logic of a given classical system © is a Boolean o-algebra, say Z, 
of subsets of S, the phase space of S. Suppose now, that an observable 
associated with the system is represented by the real valued function f on 
S. The statements concerning the observable are then those which assert 
that its value lies in an arbitrary Borel set E of the real line and these are 
represented by the subsets f~+(#) of S. The observable can thus be repre- 
sented, without any loss of physical content, equally by the map 
E —> f ~1(£) of the class of Borel subsets of the real line into Z. The range 
of this mapping is a sub-o-algebra, say &,. Suppose g is a real valued Borel 
function on the real line. Then, the observable represented by the function 
g o f (s > g(f(s))) can also be represented by the map E > f~1(g~1(#)) 
from which we conclude that #,,, is contained in £. 

In order to formulate the general features of a classical mechanical 
system in terms of its logic &, it is therefore necessary to determine to 
what extent an abstract o-algebra Z can be regarded as a o-algebra of 
subsets of some space S; further to determine the class of mappings from 
the o-algebra of Borel sets of the real line into Y which correspond to 
real valued functions on S; and to clarify the concept of functional 
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dependence in this general context. We shall now proceed to a discussion 
of these questions. 


3. BOOLEAN ALGEBRAS 


Let Z be a nonempty set. Z is said to be partially ordered if there is a 
relation < between some pairs of elements of Y such that (i) a<a for all 
ain £&; (ii)a<b and b<aimply a=b; (iii) a<b and b<cimplya<c. If Z 
is partially ordered, there is at most one element called the null or zero 
element and denoted by 0, such that 0<a for all a in Z. Similarly there 
is at most one element called the unit element and denoted by 1, such that 
a<1 for alla in Z. More generally, for any nonempty subset F of # there 
exists at most one element c of Z such that (i) a<c for alla e F; (ii) if d 
is any element of # such that a<d for all ae F, then c<d. We shall 
write Vaer 4 for c whenever it exists. If F is a finite set, say F={a,,---, an}, 
it is customary to write \/?., a, or a, Va_.V--- Va, instead of Vaer 4- 
In an analogous fashion, for any subset F of Z there exists at most one 
element c such that (i) c<a for alla e F; (ii) if d is any element of # such 
that d<a for allae F, then d<c; we denote it by Naer a whenever it 
exists. If F is a finite set, say /’={a,,---,a,}, we often write A%.; a, or 
A, Aa A +> AG, instead of Ager a. The partially ordered set Z is called 
a lattice if 


(11) (i) Oand l exist in F and 01, 
(ii) V, a and A a exist for all finite subsets F of Z. 

Suppose that Y is a lattice. Given any element a of Z, an element a’ of 
Ẹ is said to be a complement of a if aAa’=0 and ava'=1. a is then a 
complement of a’. Z is said to be complemented if, given any element, 
there exists at least one complement of it. It is obvious that 0 and 1 have 
the unique complements 1 and 0, respectively. A lattice Y is said to be 
distributive if for any three elements a, b, c of Y, the identities 


aN(bve)=(@ANb6)V(@Ao), 
av (bAc)=(av 6b) A (ave) 


are satisfied. A complemented distributive lattice is called a Boolean 
algebra. A Boolean o-algebra £ is a Boolean algebra in which A aer @ and 
Vaer @ exist for every countable subset F of Z. 

Every element in a Boolean algebra has a unique complement. Suppose 
in fact that Y is a Boolean algebra and that a is an element with two 
complements a, and a,. Then, one has 


a, = a, A (a V Gg) = (A; Aa) V (@ A Ag) = A, A Ag < Q3; 
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similarly, a,<a,, so that a, =a. The unique complement of a is denoted 
by a’. Using the standard manipulations of set theory it is easy to show 
that (Naer 4) = Vaer @ and (Vaer 4Y = Naer a for any finite subset F 
of Z. If & is a Boolean o-algebra, then the same identities remain valid 
even when F is countably infinite. If Y is any Boolean algebra and a, b 
are elements in it with a <b, c=a’ A 0 is the unique element of Y such that 
aANc=0 and aV c=b; c is called the complement of a in b. Since c=b Aa’, 
c<a’ (cf. Birkhoff [1] and Sikorski [1] for the general theory of Boolean 
algebras and o-algebras). 

A homomorphism of a Boolean algebra £, into a Boolean algebra £, is 
a map h of Z into L such that (i) h(0)=0, A(1)=1; (ii) h(a’) =h(a)’ for 
alla in Z; (iii) h(a v b)=A(a) v h(b), h(a A b)=h(a) A h(b) for all a, b in L. 
If h is a homomorphism and a <b, then h(a) <h(b). An isomorphism of L 
onto £, isa homomorphism h of L, onto Y, such that h(a) =0 if and only 
if a=0; in this case h is also one-one. 

The class of all subsets of any set is a Boolean algebra under set 
inclusion and set complementation. However, obviously this is not the 
most general Boolean algebra since infinite unions and intersections exist 
in it. Suppose now that X is a topological space. The class of subsets of X 
which are both open and closed (open-closed) is obviously a Boolean 
algebra. A well known theorem of Stone [1] asserts that every Boolean 
algebra is isomorphic to one such and that, if we require the topological 
space to be compact Hausdorff as well as totally disconnected, it is 
essentially uniquely determined by the Boolean algebra. We recall that a 
compact space is said to be totally disconnected if every open subset of it 
can be written as a union of open-closed subsets. We shall call a compact 
Hausdorff totally disconnected space a Stone space. 

Let Y be a Boolean algebra. A subset Æ of Z is called a dual ideal if 
the following properties are satisfied: 

(i) OEM, 
(ii) ifaeMW and a<b, then be Me, 

(iii) ifa,be M, then aa be. 


M is said to be maximal if it is properly contained in no other dual ideal. 

The naturalness of the notion of maximal dual ideals can be seen in the 
following way. Let X be a Stone space and Z = #(X) the Boolean algebra 
of all open-closed subsets of X. Then, for any x e X, the collection A(z), 


where 
M(x) ={A: AE Lf, xe A}, 


is easily seen to be a maximal dual ideal; it is also easy to check that the 
correspondence x —> M(x) is one-one if we notice that X is Hausdorff. 
The concept of maximal dual ideals is central in the proof of Stone’s 
theorem. 
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Suppose that X is an arbitrary Boolean algebra. Using Zorn’s lemma 
one can show easily that maximal dual ideals of Z exist. Let X =X(f) 
be the set of all maximal dual ideals of #. For any a € # we define X, by 


Xa ={M: MEX, ace MM, 


X,=@, the null set, and X, =X. We shall say that a subset ACX is 
open if A is the union of sets of the form X,. This definition defines the 
structure of a topology on X called the Stone topology. We now have: 


Theorem 1.1 (Stone [1]). Let Z be a Boolean algebra and let X=X(L) 
be the space of all maximal dual ideals of L. Then, equipped with the Stone 
topology, X becomes a Stone space. The map a —> X, 18 then an isomorphism 
of L with the Boolean algebra of all open-closed subsets of X. X 1s determined 
by L, among the class of Stone spaces, up to a homeomorphism. More 
generally, let X and Y be Stone spaces and let L(X) and L(Y) be their 
respective Boolean algebras of open-closed subsets. If u is any isomorphism 
of L(Y) onto L(X), there exists a homeomorphism h of X onto Y such that 


(12) u(A) = h4) (Ae ¥(Y)); 
moreover, h is uniquely determined by (12). 


This theorem is very well known and we do not give its proof. The 
reader may consult the books of Birkhoff [1], Sikorski [1], and the paper 
of Stone [1] for the proof. 


Corollary 1.2. Let X be a Stone space and let L =L (X) be the Boolean 
algebra of open-closed subsets of X. If t — D(—œ<t<œ) is any one- 
parameter group of automorphisms of L , there exists a unique one-parameter 
group t — h, of homeomorphisms of X onto itself such that for all t and 
AeL, D(4)=ħ*(4). 


Proof. Theorem 1.1 ensures the existence and uniqueness of each h,. If 
tı, ta are real, then h, +: and hi, °, induce the same automorphism 
D, +t of L, SO that hi, +to =h, o hı. 


The theorem of Stone shows that there is essentially no distinction 
between an abstract Boolean algebra and a Boolean algebra of sets. If 
one deals with Boolean o-algebras, the situation becomes somewhat less 
straightforward. We shall now describe the modifications necessary when 
one replaces Boolean algebras by Boolean a-algebras. 

If Z, and Z, are Boolean a-algebras, and h a map of Y, into Zz, h is 
called a oa-homomorphism if (i) h(0)=0, A(1)=1; (ii) h(a’)=A(a)’ for all 
ae Y,; and (iii) if F is any subset of Z, which is finite or countably 
infinite, h(\Vaer @)= Vaer h(a) and h( Naer 4)= Naer h(a). Suppose Y, 
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and £, are two Boolean o-algebras and h a o-homomorphism of £, onto 
F. The set VW ={a : a e Z, h(a) =0} is a subset of Z, with the properties: 
(i)0EN,1L EN; (ii) ifa €e WY and b <a, then b € M; (iii) if F is a countable 
subset of M, \Vaer a E M. VW is called the kernel of h. Suppose conversely 
£ is a Boolean o-algebra and VY a subset of Z with properties (i) to (iii) 
listed above. We shall say that elements a and b of & are equivalent, 
a~b, ifaanb’ and b^a' are in YW. It is easily verified that ~ is an equiv- 


alence relation. Let Y be the set of all equivalence classes, and for any a 
in £, let a denote the unique equivalence class containing a. We define 
a<b whenever there are elements a in G, and b in b such that a<b. It is 


then easily shown that Y is a Boolean o-algebra whose zero and unit 
elements are, respectively, 0 and 1, and that the map a—>d@ is a ø- 


homomorphism of Z onto Z with kernel M. We write f= LIN. 


Theorem 1.3 (Loomis [1]). Let Z be a Boolean a-algebra. Then there 
exists a set X, a o-algebra S of subsets of X, and a a-homomorphism h of 
F onto L. 


Proof. Let X be a Stone space such that the lattice Y’ = £(X) of open- 
closed subsets of X is isomorphic to the Boolean algebra Z. Let S 
denote the smallest o-algebra of subsets of X containing #’. We denote 
by U and A the operations of set union and set intersection for subsets of 
X, and by v and A the lattice-theoretic operations in Z and Z’. 


If A,, 4z, ++- is any sequence of sets in Z’, then V, A,=A exists in 
£’ since YL’ is isomorphic to # and Ñ is a o-algebra. Since A is the smallest 
element of Z’ containing all the A,, it follows that the set A—\), A, 
cannot contain any element of Z’ as a subset. The sets in Z’ form a 
base for the topology of X and hence we conclude that A—\_), A, cannot 
contain any nonnull open set. Since J, A, is open, this shows that 
A—\),, A, is a closed nondense set. 

Consider now the class S, of all sets A e S with the property that for 
some B in Z’, (A—B) U (B— 4A) is of the first category. If B, and B, 
are elements of Y’ such that (A —B,) U (B,—A) is of the first category 
(t=1,2), then it will follow that (B,—B,)U(B,—B,) is of the first 
category, which is not possible (by the category theorem of Baire) unless 
B,=B,. Thus, for any A in S, there exists a unique B=h,(A) in Z’ 
such that (A—B) U (B—A) is of the first category. Clearly Z'S S 
and for A e #’, h,(A)=A. 

We claim that S is a o-algebra. Since 


(A—B) U (B—A) = (A’—B’) u (B’—A’) 


(primes denoting complementation in X), we see that for any A in S, 
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A’ is in SY, and h,(A’)=h,(A)’. Suppose A,, Ag,--- is any sequence in 
S,. Write B,=h,(A,), A=U, 4A, B=Vn Ba Bo=U, Ba. By what we 
said above, B—B, is a closed nondense set. Moreover, as B, < B, we have 


(4- B) U (B—4) £ {(A—B,) Y (Bo—A)} U (B— Bo) 
S Y {(A,— Ban) U (B,—A,)} U (B— B»). 


As all members of the right side are of the first category, this proves that 
Ae S, and h,(A)=\/,4,(A,). In a similar fashion we can show that 
(\n 4n lies in Sı and h(n An)= An (An) 

The conclusions of the preceding paragraph show that Y,, is a Boolean 
o-algebra S S. Since S, contains L’, S= S. Moreover, we see at the 
same time that h, is a o-homomorphism of S onto L’. If we write h =k o h, 
where k is an isomorphism of #’ onto Y, then h is a o-homomorphism 
of F onto Z. 


Remark. Let Z be the o-algebra of Borel sets on the unit interval [0,1], 
MN the class of Borel sets of Lebesgue measure 0, and Y= S/W. Then L 
is a Boolean o-algebra. We can obviously define Lebesgue measure A as a 
countably additive function å on Z; À is strictly positive in the sense that 
for any a#0 of Z, A(a) is positive. From this it follows that any family 
of mutually disjoint elements of # is countable. On the other hand, since 
F is countably generated, so is Z. However, any o-algebra of subsets of 
some space X which is countably generated can be proved to have atoms, 
that is, minimal elements. Since & does not have atoms, £ cannot be 
isomorphic to any o-algebra of sets. 


4. FUNCTIONS 


We now take up the second question raised in section 2, namely, the 
problem of describing the calculus of functions on a set X entirely in 
terms of the Boolean o-algebra of subsets of X with respect to which all 
these functions are measurable. The results are summarized in theorems 
1.4 and 1.6 of this section. 

Let X be any set of points x and S a Boolean o-algebra of subsets of X. 
A function f from X into a complete separable metric space Y is said to be 
F -measurable if f-}(H) e F for all Borel sets HG Y. If f is A-measurable, 
the mapping E — f~1(£) is a c-homomorphism of the o-algebra Z(Y) of 
Borel subsets of Y into S. Suppose now Ẹ is an abstract Boolean o- 
algebra. We shall then define a Y-valued observable associated with £ to be 
any o-homomorphism of Z(Y) into Z. If Y= Rt}, the real line, we call 
these observables real valued and refer to them simply as observables. 
From our definition of c-homomorphisms we see that a map u(E —> u(E)) 
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of Z(Y) into ¥ is a Y-valued observable associated with # if and only if 
(i) u(@)=0, u(Y)=1; (ii) u( Y—H)=u(Z) for all E in AY); (iii) if 
E, E>,- -- is any sequence of Borel sets in Y, u(n, #,)=\V, U(E) and 
u(n E.) = An u(En). 


Theorem 1.4 Let X be a set, F a Boolean c-algebra of subsets of X and 
h a a-homomorphism of S onto a Boolean c-algebra L. Suppose further 
that u(E — u(E)) is any (real valued) observable associated with L. Then 
there exists an S -measurable real valued function f defined on X such that 


(13) u(E) = h(f-*(4)) 


for all Borel sets E S Rt. f is essentially unique in the sense that if g is any 
S -measurable real valued function defined on X such that u(E)=h(g7+(E)) 
for all Borel sets E S Rt, the set {x : x e X, f(x) #g(x)} belongs to the kernel 
of h. l 


Proof. We begin with a simple observation. Suppose A and B are two 
subsets of X in S such that ACB, and c any element of Z such that 
h(A)<c<h(B). Then we can select a set C in S such that A ECS B and 
h(C)=c. In fact, since h maps Z onto Y, there exists C, in Z such that 
h(C,)=c. If we define C=(C, AO B) U A, then ACCCB while 


h(C) = (A(C,) A A(B)) v A(A) = (CA 6) Va=e. 


We now come to the proof of theorem 1.4. Let 7,, 72,--+ be any distinct 
enumeration of the rational numbers in R? and let D, be the interval 
{t :te Ri, t<r}. Evidently, u(D,)<u(D,) whenever 7,;<7,. We shall now 
construct sets A,, Ao,--- in S such that (a) A(A,)=u(D,) for all i=1, 2, 3, 
--+3 (b) A;S.A, whenever r,<7,. Let A, be any set in S such that 
h(A,)=u(D,). Suppose A,;, Ag, Ag,---, A, in S have been constructed 
such that (i) h(A,;)=u(D,) for 1=1, 2,---, n; (ii) A; SA, whenever 7,<7,, 
1<1t,7<n. We shall construct A,,, as follows. Let (i, 2,---,2,) be the 
permutation of (1, 2,---,m) such that 7,,<7,,<---<7,,. Then, there 
exists a unique k such that 7,,<1i41<1%,,, (we define 7,,=—oo and 
Ting, = +), and by the observation made in the preceding paragraph, 
we can select A,,, in S such that A,,CA,,,54;,,, (we define A, =Ø, 
Anı =X). The collection {4,, Ag,--+,An4i} then has the same pro- 
perties relative to 11,7%9,°-+,%n+1 aS {A,,:--,A,} had relative to 
11,12.'**, Ta It thus follows by induction that there exists a sequence 
A,, Ag::- of sets in S with the properties (a) and (b). As 


QA) = md =A D) =o 


we may, by replacing A, by A,—(),A, if necessary, assume that 
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where N =X —\_), A;. We now define a function f as follows: 


0 if zeN, 
(14) Fæ) = tint, :ae A} if wel A, 
J 
Clearly, f is finite everywhere. Moreover, for any k, 


JD) A (X-N)= U Ap 


Fitz <Tk 
so that f is “-measurable. Further, 
MfD) = hi UJ A;) 


j:T] <Tk 


so that h(f-1(E))=u(E) whenever E= D, for some k. Since the class of 
all Æ for which this equation is valid is a Boolean o-algebra, we conclude 
that A(f-1(E))=u(E) for all Borel sets Æ. 

It remains to examine the uniqueness. Let g be any real valued J- 
measurable function on X such that h(g~+(#))=u(Z£) for all Borel sets Æ. 
Then, if we write D,’ for R'— D,, 


M = {x : xex, fx) + g(x)} 
= U De) Ag De) U (FHD) A gD} 


so that 
h(M) = V {u(Dx) A u(D,’)} 


= 0. 


This shows that M belongs to the kernel of k. This completes the proof 
of the theorem. 


Lemma 1.5. Let X be a set, F a o-algebra of subsets of X and fan S- 
measurable mapping of X into R". Suppose S~ ={f- F) : Fe B(R*)}. 
Then to any S~-measurable real function c on X there corresponds a real 
valued Borel function c~ on R” such that c(x)=c~(f(x)) for al xe X. 


Proof. Since c is /~-measurable, there exists a sequence c, (n=1, 2,---) 
of /~-measurable functions such that (i) each c, takes only finitely many 
values; (ii) c,(2) —> c(x) for all x e X. For any n, let Ani, Ano, ++, Any, be 
disjoint subsets of X whose union is X such that c, is a constant, say a,,, 
on A,,, the a,, being distinct for1=1, 2,---, k,. Since c, is “~-measurable, 
An E S~; so there exists a Borel set B,, of R” such that A,,=f~+(B,,) 
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(t=1,2,---,k,). Replacing B,, by Bu—l\lU;<i Bn; if necessary, we may 
assume that the B,, are disjoint. Let us define the function c,~ on R” as 


follows: 
a, if te By, 
OM = 19 if teU Bu 
t 


Clearly c,~ is Borel and c,(x)=c,~(f(x)) for all x in X. Let us define c~ 
on R” as follows: 
fe- c,,~ (t) if the limit exists, 
ce~(ét) = 4° 


0 otherwise. 


Clearly c~ is Borel. Since c,,(x)=c,,~(f(x)) and lim c,(x) exists and is equal 
to c(x) for all x in X, c~(f(x))=c(x) for all x in X. 

Let Z be a Boolean o-algebra. L SẸ is said to be a sub-c-algebra if 
(i) 0, 1 e Z; (ii) if a e L, then a’ e L; (iii) if a}, ag,--- are in Z, then 
Van an and Nn a, are in Z. A sub-c-algebra Z is said to be separable if 
there exists a countable subset D of Z such that Z is the smallest sub-o- 
algebra of Y containing D. 


Theorem 1.6. (i) Let Z be a Boolean a-algebra and u(E —> u(E)) an 
observable associated with L. Then the range L, ={u(E) : E e B(R!)} of u 
1s a separable Boolean sub-o-algebra of L. Conversely, if L, is a separable 
Boolean sub-c-algebra of L, there exists an observable u associated with L 
such that L, is the range of u. 

(ii) Let u, (i1=1,2,..-,n) be observables associated with L, and £, 
(1=1,2,..., n) their respective ranges. Suppose Ly is the smallest sub-o- 
algebra of L containing all the f,. Then there exists a unique o-homomorphism 
u of Bh") (the o-algebra of Borel subsets of the n-dimensional space R") 
onto La such that for any Borel set E of R*, u,(#)=u(p,~1(E)), where p, is 
the projection (t,, ta,- - -, tn) >t, of R” onto R!. If p is any real valued Borel 
function on R", the map E —> u(p E)E e Z(R!)) is an observable 
associated with L whose range is contained in Ly. Conversely, if v(E —> v(E)) 
is any observable associated with L such that the range of v is contained in 
FL o, there exists a real valued Borel function p on R” such that v(#) =u(p-1(£)) 
for all E. 


Proof. If u is an observable with range Z, &, is obviously the smallest 
sub-o-algebra of Z containing all the u(E), where E is any open interval 
of Rt with rational end points. This shows that Z, is separable. 

Suppose conversely that Z SF is a separable sub-c-algebra of Z. 
By theorem 1.3 there exists a set X, a o-algebra SZ of subsets of X, and a 
o-homomorphism h of S onto L. Let {A, :n=1, 2,---} be a countable 
family of sets of S such that Z, is the smallest sub o-algebra of Z 
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containing all the h(A,). We denote by S, the smallest c-algebra of subsets 
of X containing all the A,. The function 


C22 —> (X4, (2), X4a (T) ++ Xay(@)s +++) 


(where y, denotes the function which is 1 on A, and 0 on X — 4) is Sy- 
measurable from X into the compact metric space Y which is the product 
of countably many copies of the 2-point space consisting of 0 and 1. 
Moreover, it is obvious that each A, is of the form c~}(F) for some Borel 
set FCY, and hence Sa ={071(F) : F Borel in Y}. Now, by a classical 
theorem (Kuratowski [1]), there exists a Borel isomorphism d of Y onto 
R?, so that the function c, : x —> d(c(x)) is an S -measurable real valued 
function and Y,)={c,~1(H): E Borel in R*}. If we now define, for any 
Borel set E of Rt, u(E) by the equation 
u(E) = h(c,~*(#)), 

then v is an observable associated with # whose range is #,. This proves (i). 
~ We now come to the proof of (ii). Suppose u,, Uz, -++, U, are observables 
associated with Z, having ranges ¥,,---, &,, respectively. Each Z, is 
separable and hence Zp, the smallest sub-c-algebra of Z containing all 
the £,, is also separable. Let X, S, and h have the same significance 
as in the proof of (i). By theorem 1.4, there exists a real valued S- 
measurable function f; on X such that u,(#)=A(f,-+(£)) for all Borel 
subsets E of R!. Let f be the map x—>(f,(x),---,f,(z)) of X into R°. 
Then f is S-measurable. The map u : F —h(f-1{F)) is then a o- 
homomorphism of 4(R#") into Z such that u,(£#)=u(p,-*(£)) for all 
He&@(R*). Since Z(R”) is the smallest c-algebra of subsets of R” con- 
taining all the sets p,~1(£), it is clear that the range of u is Yo. The 
uniqueness of u is obvious. 

For any real Borel function » on R”, E —> u(p~1(£)) is an observable 
associated with Y whose range is obviously contained in £.. Suppose 
now that v is an observable associated with Y whose range Y,CP p. 
If we use the notations of the previous paragraph, and define S~ by 

S~ = (fF): Fe BR}, 
then h maps S~ onto Ly. Applying theorem 1.4 to S~ and v, we infer 
the existence of a real valued S/~-measurable Borel function c on X 
such that h(c~+(#))=v(Z#) for all Borel sets E of Rt. By lemma 1.5, since 
c is “#~-measurable, there exists a real valued Borel function p on R” 
such that c(x)=9(f(x)) for all x e X. If now Æ is any Borel set on the line, 


we have: 
u(p~*(Z)) = h(f-*(p-(£))) 
= h(c-*(#)) 
= v(£). 
This proves (ii) and completes the proof of the theorem. 
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Remark. The uniqueness of u, guaranteed by (ii) of theorem 1.6 shows 
that u is independent of the constructs X, Z, and h, used in its construc- 
tion. Consequently, for any real Borel function p on R”, the map 
E —>u(p7+(#)) is uniquely determined by uj, vg,---,u, and ø. It is 
natural to denote this observable associated with Z by o(u,, Ug, +--+, Un). 


NOTES ON CHAPTER I 


1. For a beautiful modern introduction to classical mechanics, see the 
book of V.I.Arnold, Mathematical Methods of Classical Mechanics, 
Springer-Verlag, New York, 1978. 


CHAPTER II 
PROJECTIVE GEOMETRIES 


1. COMPLEMENTED MODULAR LATTICES 


The point of departure of our discussion of quantum phenomenology is the 
observation that the partially ordered set of experimentally verifiable state- 
ments associated with an atomic system cannot be expected to possess the 
distributivity properties characteristic of the Boolean algebras associated 
with classical systems. The simplest and most interesting of the mathe- 
matical structures that model such systems are the projective geometries, 
namely, the lattices of subspaces of vector spaces. More precisely, let D be 
a division ring, and let V be a vector space of finite dimension n > 2 over D; 
we shall always suppose our vector spaces to be left vector spaces (cf. 
Jacobson [1]). Then #=L(V,D), the lattice of linear subspaces of V 
partially ordered by inclusion, is called the projective geometry of V. This 
chepter will begin with a brief review of the basic properties of projective 
geometries. Although these are too simple to serve as models for realistic 
quantum systems, they already possess many of the fundamental features 
of the more complex systems. 

The first example of an abstract projective geometry that anyone comes 
across is the projective plane, described axiomatically in terms of its points, 
lines, and their incidence properties. To define higher dimensional pro- 
jective spaces one replaces incidence by partial order and uses a dimension 
function to characterize the hierarchies of points, lines, planes, and so on; 
the properties of incidence that are characteristic of a projective geometry 
are then summarized by certain natural properties of the dimension 
function. 

We recall that a lattice Z is complemented if for any ae& there is a 
bef such that aAb=0 and avb=1. A complemented lattice Z is modular if 


(1) a,b,ceL, c<a>an(bVc)=(anb)Ve. 


Boolean algebras are modular; for any division ring D and any finite 

dimensional vector space V over D, #(V,D) is a modular lattice which is 

not a Boolean algebra if dim(V) > 2. We next introduce the notion of a 

lattice of finite rank. A chain in a lattice is a sequence (a,), <:<, of elements a, 

of the lattice such that a,<a,<...<a@,, with a, #0 and a,4a, for i<j; nis 

called the length of the chain, and the rank of the lattice is the supremum of 
18 
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the lengths of all possible chains. An element of a lattice is called a point if 
it is nonzero and minimal. If z4, £2, ...,&„ are points, they are called inde- 
pendent if x,+\Vj; 4+; for i=1,2,...,n. The lattice 7(V,D) has rank dim(V), 
its points are the one-dimensional subspaces, and independence is just the 
usual linear independence. For any lattice Y and any a0 in Y, let us 
write £[0,a] for the set of all elements b of Z with 0<b<a; under the 
partial ordering inherited from #, #[0,a] is also a lattice. If X is of finite 
rank, so is #[0,a]; and we refer to its rank as the dimension, dim(a), of a. 
We use the convention that dim(0) =0. 

If # is a complemented modular lattice of finite rank n, it is not difficult 
to show that the function a->dim(a) has the following properties: 


(i) dim(0)=0, dim(a) is an integer >0 for all a; 
(ii) ifa<b, dim(a) <dim(b), with equality only for a =b; 
(2) (iii) dim(avb)+dim(aAb) =dim(a) +dim(d) for all a,b; 
(iv) if s=dim(a)>0, then s is the maximum number of independent 
points contained in a; and a family of independent points 
contained in a has sum equal to a if and only if it has s elements. 


In particular dim(1) =n and dim(x) =1 for any point x. The converse is also 
true; if # is a complemented lattice of finite rank admitting a function 
d:£->R satisfying (i)-(iii) of (2) and such that d(x)=1 for all points x, 
then # is modular and d(a) =dim(a) for all a. 

The lattice “(V,D) is modular, dim being the usual linear space 
dimension. At the other extreme is the Boolean algebra of all subsets of a 
finite set, dim being the cardinality. To single out the geometries we need 
the notion of irreducibility. An element of a complemented modular lattice 
L is called central if it has a unique complement in Y, which is written asa’: 
the reason for this terminology is the easily proved fact that a e X is central 


if and only if for all x e L 
(3) x = (x na) V (x na’). 


The set of all central elements is called the center of Z ; itis a Boolean algebra, 
which coincides with Z only when Ẹ itself is a Boolean algebra. Z is called 
erreducible if 0 and 1 are its only central elements. A geometry is an irreducible 
complemented modular lattice of finite rank. The lattices 


L(V,D) (dim(V) 2 2) 
are geometries. 

Every complemented modular lattice of finite rank may be viewed in a 
canonical manner as a direct sum of geometries over its centre. More 
precisely, let Y be a complemented modular lattice of finite rank and let 
C,,+.-,€; be the minimal (nonzero) elements of the center of Z. Let 


L, = £[0,¢,] and let P= LyX i xX Ly 
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where the partial order in # is defined by saying that 


(Xis Er) < (Yis Yu) 


if and only if x; <y; for 1 <t <k. Then each Z, is a geometry, and the map 
that takes xe [Į to _ 
(LAC, LA Co, ooe LACE EL 


is an isomorphism of Z with Z, the inverse map taking (a,,...,2,) to 
at,V...Vt,. For arbitrary geometries £;, Z defined as above is a comple- 
mented modular lattice whose center has (0,...0,1,0,...,0) (1 in the jth 
place) as its minimal elements (1 <j < k). 

The usual geometrical terminology with which we are familiar may be 
introduced as follows. Let Y be a complemented modular lattice of finite 
rank. If ae¥Y and dim(a)=1, a is a point; if dim(a)=2, a is a line; if 
dim(a) =3, ais a plane. Ifa=bvc where b and care distinct points, a is called 
the line joining these points, and b, c are said to be on a; if d is a point <a, 
b, c, d are said to be collinear. It can be proved that £ is a geometry if and 
only if every line has at least three points on it. 

The fundamental theorem of classical geometry is that every geometry of 
rank 24 is isomorphic to some X (V,D) wher the division ring D is deter- 
mined uniquely up to isomorphism; and that if the rank is 3, this is true if 
(and only if) the plane in question is Desarguesian. We shall discuss this 
question a little later. At this time we shall take a closer look at the 
geometries #(V,D). 


2. ISOMORPHISMS OF PROJECTIVE GEOMETRIES. 
SEMILINEAR TRANSFORMATIONS. 


The first problem that arises naturally is to obtain a description of all 
possible isomorphisms between “&(V,D) and ¥(V’,D’). We suppose the 
vector spaces to be always finite dimensional and that the scalars act from 
the left. To formulate the answer we need the concept of semilinear trans- 
formations. Let D, D’ be two division rings which are isomorphic and let 


(4) a: C> C7 (ceD) 


be an isomorphism of D onto D’. Let V (resp. V’) be a vector space over D 
(resp. D’). A map L (v-> Lv) of V into V’ is said to be semilinear relative to 


o or o-linear if 
(5) Lv, tv) = Lw, + Lv, (vis vE V), 
L(cv) = ce Iw (ceD, ve V). 


If L is in addition a bijection of V onto V’, we say that L is a o-linear 
isomorphism. 
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If L is a o-linear isomorphism of V with V’, then for any linear subspace 
M c J, its image L[M] under L is a linear subspace of V’, and M -> L[M] 
is an isomorphism of Y(V,D) with #(V',D’), denoted by éz. We can now 
formulate the main result. 


Theorem 2.1. There exists an isomorphism between L(V ,D) and #(V’',D’) 
if and only if D and D' are isomorphic and dim(V)=dim(V’). If £ is an iso- 
morphism of L(V ,D) with #(V',D’), there is an isomorphism o(D > D’) and 
a o-linear isomorphism L(V > V’) such that =EL. If t(D 5 D’) is another 
isomorphism and L'(V > V’) a7-linear isomorphism, then €p=€ 1; if and only 
if for some deD, d # 0, we have 


(6) cT = derd- (ceD), L'v = dlw (veV). 


For the proof we refer to Baer [1]. If D=D’ it makes sense to call an 
isomorphism E(X (V,D) > L(V,D’)) linear if there is a linear isomorphism 
L(V + V’) such that £= éz. Theorem 2.1 leads to 


Theorem 2.2. In order that every isomorphism of £(V,D) with Z(V',D) be 
linear it is necessary and sufficient that every automorphism of D be inner. 


The division rings of greatest importance in physics are R, the field of real 
numbers, C, the field of complex numbers, and H, the division ring of 
“Hamiltonian” quaternions. 


(a) R. The identity map of R is its only automorphism, i.e., R is rigid. So 
every isomorphism between real projective geometries is linear and the linear 
map is determined up to a multiplicative constant. 

(b) C. One knows from the theory of algebraic fields that there are 
infinitely many automorphisms (cf. Bourbaki [1], pp. 114-115). However, 
the identity and complex conjugation (c-> c*) are the only analytically well- 
behaved ones (e.g., measurable, bounded, etc.). Since Cis commutative, any 
isomorphism between two complex geometries determines uniquely the 
automorphism of C associated with it. 

(c) H, the quaternions. Let 


= (19) = ( °) S o 1) > [i 
— 0 17? Jı ~” 0 —4)’ jo = —1 Qo} Js =|; OQ}? 


where 7 is the usual square root of —1 in the complex number field. We 
define H to be the R-linear span of l and ja (1<a<8); since ja = —] 
(1 <a <3) and jajo= —joja=j. whenever (abc) is an even permutation of 
(123), His an (associative) algebra. It is easy to verify that it is the algebra 
generated by j, with the above relations. Let 


lq| = +det(q)? = + (q2 +92 +92 +q)? 


where q = qol +4151 + Gein + 43) (Yo. Ya E R). Then |:| is a norm on H which is 
multiplicative, i.e., |qq’|=|q||q’|, and q is invertible in H if and only if 
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q + 0. Thus H is a division ring. For qe H, say q=gol + Dic<e<sYaja, its 
conjugate q* is the quaternion gol—Dii<a<s Jaja; the map q> q* is an 
involutive anti-automorphism of H. A quaternion of norm 1 is called a unit 
quaternion. A quaternion is a unit quaternion if and only if its matrix is 


(ior — 92—193 
G—"%3 74 
so that the unit quaternions constitute the group SU(2,(C) of 2 x 2 unitary 
matrices of determinant 1. Two quaternions are said to be in the same class 
if they can be transformed into each other by an inner automorphism; for 
q, q’ to be in the same class it is necessary and sufficient that 


), d thu? +927 +95" = 1, 


tr(q) = tr(q’) (tr = trace) and |q| = |q]. 


All the automorphisms of H are inner so that isomorphisms of quaternionic 
projective geometries are linear. The map x—>zxl (xeR) imbeds R as the 
center of H. 

H may be viewed as a vector space over C, although in a noncanonical 
manner. We identify C as a subfield of H by the correspondence 


Ay +10, > ao + 44 )y3 


H is then a vector space over C, scalar multiplication being from the left. 
Then dimc(H)=2, and {1,j,} may be taken as a basis. For any qeH, 
R,:q' > q’q* is then an endomorphism of H over C, q>R, is a faithful 
representation, and the matrix of j, (1 <a <3) iS eqja, where e,= — 1 for 
a=1,3and +1 fora=2. 


3. DUALITIES AND POLARITIES 


We shall now examine the anti-automorphisms and orthocomple- 
mentations of the projective geometries Y(V,D). It will turn out that 
orthocomplementations essentially arise only from Hilbert space structures, 
at least when D is one of R, C, and H. 

We begin with the concept of the dual of a division ring. Let D be a 
division ring. We define D° to be the ring with the same elements as D, with 
the same rule for addition, but reversing the order of multiplication. D° is a 
division ring, said to be dual to D. Isomorphisms of D, with D,° may be 
viewed as anti-isomorphisms of D, with D,. If V is a vector space over D, 
its dual, defined as the space of D-linear maps of V into D, becomes a vector 
space over D° if we define scalar multiplication by writing 


(a-f)(v) =f(vja (aeD®, veV, feV*); 


addition in V* is defined in the usual way. V* is then a (left) vector space 
over D° and has the same dimension as V. Clearly (D°)®=D and so V can 
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be canonically identified with V**. If (v,,...,v,) is a basis of V, the corre- 
sponding dual basis (v,*, ...,v,*) of V* is defined by v,*(v;,) =5;,. If M = V 
is a linear subspace, its annihilator is the linear subspace of V* defined by 


(7) M? ={f:feV*,f(v) =0 forall ve M}. 
It is clear that 
(8) dim(M)+dim( M?) = dim(V) 


and so M =M”. The map M -> M’ is obviously an inclusion-reversing bi- 
jection of 7(V,D) with 7(V*,D°). We call it a duality, and use the same term 
to refer to any inclusion-reversing bijection of one geometry onto another. 

To determine the most general duality we need the concept of a non- 
singular semibilinear form. Let 0 be an anti-automorphism of D, V a vector 
space over D. A 0-bilinear form is a map <. ,.X:x, y > (x,y) of V x V into D 
with the following properties: 


(9) (i) (titza yY) = (2,4) + (224) 


&, Yı tY) = (£Y) + (T:Y2) 
(ii) (cæ, dy} = c{x,y) d? (c,deD, x,yeV). 


(x, Y, Xi» Yi» Ta, Yo E V); 


We call <. ,.ẹ nonsingular if 


(10r) (x,y =0 VaeVoy=0 
and 
(101) (x,y =0 YyeV>ə>z=Q. 


For any anti-automorphism ð of D, the map 


(Lis eee En)s (Yis Yn) LY + e FH EnYn? 
is a nonsingular -bilinear form on D” x D”. 


Theorem 2.3. Let V be a vector space of dimension n over D; 0, an anti- 

automorphism of D; and ¢.,.), a 0-bilinear form on V x V. Then «.,.) is 
nonsingular if and only if it satisfies either of (101) or (10r). For any linear 
subspace M of V let 
(11) M' = {u:ue V, (x,u = 0 for all x e M}. 
Then, if {.,.) is nonsingular, M > M’ is a duality of L(V ,D). If n> 3, every 
duality of L(V ,D) arises in this manner, for suitable 8 and <. ,.}. The pair 
0', l. ,. X induces the same duality as 8, <. , .} if and only if there is a nonzero 
deD such that for all x, ye V, ceD, 


(12) (x, yy = (x, y)d, ch = d-côd. 
In particular, if D is commutative, 0’ = 8. 


Proof. For ye V let te V* be defined by t,(u)=(u,y) (ueV). Then 
t (y->t,) is a 0-linear map of V into V*. Clearly (10r) is equivalent to the 
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statement that t is injective while (101) is equivalent to the statement that 
(0) is the annihilator in V of the range of t; i.e., that t is surjective. The first 
statement is now clear. Suppose now that (.,.) is nonsingular. Then ¢ is 
a 0-linear isomorphism and so, as M’=i-1(M°) we see that M -> M’ is a 
duality of X (V ,D). Conversely, if £ is a duality of 2(V,D), n: M —> (€-*(M))° 
is an isomorphism of #(V,D) onto £(V*,D°). This shows already that 


(13) dim(M)+dim(é(M)) = n. 


Moreover, theorem 2.1 gives the existence of an anti-automorphism 6 of D 
and a 6-linear isomorphism t of V with V* such that M°=t[é(1/)] for all M 
in £(V,D). If we set (x,y) = (ty)(x) (x,y e V), it is immediate that (.,.) isa 
nonsingular @-bilinear form and ¢(M)=M’ for all M. If @’ and ¢.,.)’ is 
another pair and t,'(x)=(x,y)’ (x,y e V), 0, ¢.,.)’ also give rise to é if and 
only if t and t’ induce the same isomorphism of Y(V,D) with #(V*,D°). 
Theorem 2.1 now leads to the required result. This completes the proof of 
theorem 2.3. 

A duality £ of #(V,D) which is involutive is known as a polarity; this 
means M = £(£(M)) for all M. A polarity is called isotropic if M S (M) for 
all one-dimensional M. 


Theorem 2.4. Let V be a vector space of dimension n>3 over D. Then 
L(V,D) admits an isotropic polarity if and only if D is commutative and 
dim(V) is even. In this case, if £ is an isotropic polarity and 2N =dim(V), we 
can find a nonsingular skew-symmetric bilinear form {. , .) such that &(M) = M' 
for all M. Moreover, we can find a basis {2,Y},--.,ey, Yn} for V such that 
(tix = (Yiyi =9, (Ys) = — (YX) =d (1<t,j <N). In particular, if £ 
and £ are two isotropic polarities of £(V,D), there is a linear automorphism a 
of £(V,D) such that € =afa™. 


Proof. Let £ be an isotropic polarity of #(V,D). By theorem 2.3 there 
exists an anti-automorphism @ of D and a nonsingular 6-bilinear form 
{.,-) giving rise to £. Since £ is isotropic (z,x)=0 for all xe V. Replacing 
x by x+y we see that (.,.) is skew symmetric. If now ceD, 


(x, yc? = —c (y, x) = c (x,y); 


choosing v, y such that (z,y)=1 we see that 0 is the identity. This means 
that D is commutative and <. ,. Y is bilinear. The remaining statements are 
standard. Theorem 2.4 is proved. 

We now consider nonisotropic polarities. A 8-bilinear form <. , .) is called 
symmetric if (x,y)? =(y,x} for all x,ye V. Notice that this property is 
relative to 0. We observe that if (. , .) is not identically zero, is necessarily 
involutive; then the set of values of (.,.}) is all of D, and 


(x,y) = (yr)? = (x,y). 
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Lemma 2.5. Let dim(V)>2, <. ,. X a nonsingular 0-bilinear form such that 
(xx) = (2,29 #0 for some xe V. Suppose {.,.) induces a polarity. Then 
this polarity is nonisotropic, 0 is involutive, and {., .) is symmetric. 


Proof. Ifu,veV, (u,v) =0<D-v ¢ (D-u)’ + (D-u) S (D-v)' = (v,u) =0. 
Let c=(x,x). Then x ¢(D-2x)’ and dim((D-z)’)=n—1, so that 


V = (D-2z) @ (D-2)’. 


If u,ve(D-2)', ((u,v)e—u,x+cev)=0, so that (x+cv,(u,v)x—u)=0, 
proving that (u,v)9=(v,u). Now ¢.,.) is not identically zero on 


(D.2)’ x (D.2)’, 


and so this already proves that @ is involutive. The symmetry of <. ,.} on 
V x V now follows from direct computation. Lemma 2.5 is proved. 


Theorem 2.6. Let 0 be an involutive anti-automorphism of D, V a vector 
space of dimension n over D, and {.,.) a nonsingular symmetric 6-bilinear 
form on V xV. Then the duality corresponding to {.,.}) is a nonisotropic 
polarity unless D is commutative and of characteristic 2. Conversely, let n>3 
and let € be a nonisotropic polarity of L(V D). Then there exists an involutive 
anti-automorphism 0 of D and a nonsingular symmetric 0-bilinear form 
«.,.)0n V x V such that £ 1s induced by <. ,.). If 8’ andl., .)’ is another pair, 
é is also induced by them if and only if there is ad 40 in D such that for all 
z,yeV,ceD 

(x,y = (x,y) d, c?” = d-1¢5d, 
In this case d? = d. 


Proof. Given 6 and ¢., .), the corresponding duality is a polarity. If this 
were isotropic we can argue as in theorem 2.4 to conclude that D is com- 
mutative, @=identity, and <. , . X is skew symmetric. As <. , . X is symmetric 
it follows that D has characteristic 2. Conversely, let n>3 and let ¢ be a 
nonisotropic polarity of (V,D). Then £ corresponds to a pair bp, ¢.,.). 
Choose xe V such that (x,z}o=c#0. Writing 


(U,V) = (U,V) 9 Cg}; c? = Cocho}, 


we see that é is also induced by 0, <.,.}, and that (2,2)=1. Lemma 2.5 
shows that @ is involutive, and <. ,.} is symmetric; clearly, 6, <. ,. Y induce 
£. Suppose 6’ is an anti-automorphism of D and {.,.)’ a symmetric 
6’-bilinear form. Then, by theorem 2.3, 0’, ¢.,.)’ induce £ if and only if 
for some d40 in D, (x,y)'=(a,y)d, (x, ye V) and c?” =d-'c4d for all ceD. 
If (x,y) = 1, then (y,x)=1 and so ((a,y)')® =(y,2)’ =d while 


((x,y)')? =d" =dd°d, 


also. So we have d? =d. 
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Remark. It is useful to note that the form <., .) inducing £ has the prop- 
erty that (w,w)=1 for some we V. 


4, ORTHOCOMPLEMENTATIONS AND 
HILBERT SPACE STRUCTURES 


Anorthocomplementation of £ (V ,D) is a polarity £ such that M ^ €(M)=0 
for all M, i.e., an inclusion reversing bijection £ of #(V,D) with itself such 
that €(€(M/)) = M and M A^ €(M)=0 for all M. It is easy to see that a polarity 
is an orthocomplementation if and only if (D-x) Né(D-2z)=0 for all ze V. 
The preceding discussion leads to 


Theorem 2.7 (Birkhoff-von Neumann [1]). Let n=dim(V)>3 and E 
an orthocomplementation of £(V,D). Then there is an involutive anti- 
automorphism 0 of D and a nonsingular symmetric 0-bilinear form «.,.) on 
V x V such that 6 and <. ,.} induce £. We have (x,x)=0 <> 2=0, and ¢.,.) 
can be chosen so that (w,w) =1 for some we V. 

Let a symmetric 9-bilinear form <. ,.} be called definite if 


(i) (v2) =0 + x=0, 


14 
(14) (ii) <w,w}=1 forsome wel. 


A definite form is necessarily nonsingular. Assume now that D is one of 
R, C, or H. 


(a) D = R. Then the definite forms are precisely the positive definite 
quadratic forms. So the definite form ¢.,.) of theorem 2.7 defines a scalar 
product and é is the orthocomplementation in the corresponding Hilbert 
space structure. 

(b) D = C. Here we must assume that @ is continuous or measurable 
(analytically well behaved). Since complex bilinear forms are not definite, 
0 must be the complex conjugation.The definite forms are thus the Hermitian 
positive definite forms, and in theorem 2.7, é is the orthocomplementation 
defined by the Hilbert space structure corresponding to the scalar product 
(.,+) 

(c) D = H, the quaternions. A scalar product for a vector space V over H 
is a *-bilinear form on V x V such that: (i) for any x—> V, (xx is in the 
center of H and coincides with a nonnegative real number; (ii) (7,2) =0 if 
and only ifx=0; if ||x|| = + (a,x), ||: || is a norm for V. If V is complete under 
l-l}, it is called a Hilbert space; this is the case when dim(V)<oo (cf. 
Finkelstein and others [1]). Let us now consider #and <. , .) asin theorem 2.7. 
Then we can find a unit quaternion q, such that c9=q)c*q,—! for all ce H. 
Since @ is involutive, q? lies in the center of H and so q= +1. 


We shall now show that if q= — 1 there are vectors 40 with (2,2) =0. 
In fact, we can then write qo = qo j,4o~? where qy is a unit quaternion. Let 


PROJECTIVE GEOMETRIES 27 


ue V be such that (u,u) is a unit quaternion, say a, and let wef (H- x) be 
such that (7,7) is also a unit quaternion, say b; of course V=H:-u@é(H- u). 
From (u,u)9 = {u,u) we get, after a small calculation, 


'-lg ‘_ Cc r) 
To ao —r* ¢ 


where cER, |@|+|r]?=1. Hence tr(aqo)=0. Similarly, tr(bq,)=0. So 
aq, and — bq are in the same class, proving that qaqoqı*qo 1 +b=0 for 
some unit quaternion q,. But then (x x} =0 for x=q,u+v. So, q= +1 is 
the only possibility. Since det(q,)=1, we must have q, = + 1. So 8 must be 
conjugation and <.,.) is *-bilinear. In particular, (u,u is real for all 
ueV,and >0 for ue V\(0). V thus becomes a Hilbert space under (.,.), 
and é is the associated orthocomplementation. We thus have 


Theorem 2.8. Let D=R or H and let V be a vector space of dimension 
n>3 over D. If £ is any orthocomplementation in £(V,D), there is a scalar 
product for «.,.) converting V into a Hilbert space such that £ is the corre- 
sponding orthocomplementation. If D =C and £ is regular in the sense that the 
anti-automorphism of C defined by £ is continuous, then it is the complex 
conjugation, and £ is induced by a Hermitian scalar product <. ,.Y that con- 
verts V into a Hilbert space over C. Finally (.,.) is determined by £ up to 
multiplication by a positive real number. 


We shall now assume that D is one of R, C, or H, and that V is a Hilbert 


space of dimension n > 3 over D, with scalar product ¢.,.). 


Theorem 2.9. Suppose £ is an automorphism of £(V,D) which preserves 
the orthocomplementation of V. Then there exists a semilinear automorphism 
Lof V inducing £ such that: 


(i) «tfD=RorH, Lis linear, and for D =C, L is either linear or conjugate 


linear; 
(ii) forallz,yeV, 
(15) (La, Ly) = (x,y) 
if L is linear, and 
(16) (Le,Ly) =(y,%) 


if D= Cand Lis conjugate linear. In particular, L is an isometry of V. 


Proof. We begin with the following observation.Let p be an automorphism 
of D and L, a g-linear isomorphism of V onto itself such that the induced 
automorphism of £(V,D) preserves the orthocomplementation of V; then, 
for some d #0 in D, we have, for all a, ye V, 


(17) (L,x,Lyy) = (u,y)?d 
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Indeed, (2,y) =0 if and only if (L,2,L,y) =0; so, if we define 


(xy) = (Lix, Ly)?  (x,ye V) and p=9-l0y 


where 6 is the anti-automorphism corresponding to ¢.,.), then the pair 
W, {.,.)~ also induces the orthocomplementation in (V,D). Theorem 2.3 
now leads to (17). This said, let us first suppose that £ is induced by a 
linear automorphism L, of V. Then (17) becomes (L x, Liy) =(2,y) d. This 
shows that d is a real positive number, and we get the required result on 
taking L = d—-1L,. If £ is not linear, then D = C and there is an automorphism 
g of C which is not the identity and a 9-linear isomorphism L of V with 
itself, such that L, induces £. In (17) we take x =y =x, where x is such that 
(£o) =1, so that d is a positive number. Then, taking x=y in (17) shows 
that (a,7)? is real and >0 for all xe V, so that r? >0 for all real numbers 
>0. In other words R? = R, which implies y= identity on R. As 9 is not the 
identity, 1” = — i and so ọ is complex conjugation. If L=d-1L,, we get (16). 
This proves theorem 2.9. 

When D = Cand Lis a conjugate linear isomorphism of V with itself such 
that (16) is satisfied, we call L anti-unitary. Of course, if L is linear and 
satisfies (15), L is unitary. 


5. COORDINATES IN PROJECTIVE AND 
GENERALIZED GEOMETRIES 


The study of the projective geometries &(V,D) gains significance in 
view of the fundamental theorem of classical geometry which asserts that 
these are the only geometries, at least when the rank is > 4. When the rank 
is 3, i.e., when we are dealing with a projective plane, the geometries 
£(V,D) are characterized by the property of being Desarguesian, namely, 
that the classical theorem of Desargues is true for them. Of course a pro- 
jective plane imbedded in a geometry of higher rank is always Desarguesian; 
this can be proved directly and in a simple manner. 

However, as was mentioned at the beginning of the chapter, the geo- 
metries (V ,D) and, more generally, lattices of finite rank, are not adequate 
to serve as models for the proposition calculi of complex quantum 
systems. We shall therefore enlarge the scope of our discussion by including 
a class of partially ordered sets of infinite rank, and proving for them a 
coordinization theorem that will include the classical result. The objects that 
we shall study will be called generalized geometries; and the main result 
concerning them is that they are isomorphic to the partially ordered sets of 
finite dimensional subspaces of possibly infinite dimensional vector spaces 
over division rings. 
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Let £ be a set with partial ordering <; we do not assume that L has a 
unit element. £ is called a generalized geometry if the following conditions 
are satisfied: 


(i) for any finite subset F of Z, V..74, Nep, exist in Z, 
(18) 
(ii) ifaeY and a0, £[0,a] is a geometry. 


Thus # is a geometry if and only if it has a unit element. If D is a division 
ring and V a not necessarily finite dimensional vector space over D, the 
partially ordered set “(V,D) of finite dimensional subspaces of V is a 
generalized geometry, which is a geometry if and only if dim(V) <œ. If Z 
is a generalized geometry, # has points and every element of # is a sum of 
points. The dimension functions of the #[0,a] are mutually compatible and 
define a dimension function on all of Z. We write dim() for 


Sup, e (dim (a)). 


We can obviously speak of lines, planes, etc., in Y. In the rest of this section 
we shall sketch a proof that any generalized geometry is isomorphic to some 
L(V ,D), provided dim(/) > 4. Our proof is a straightforward variant of the 
classical one; but we have decided to include a sketch since it is not easy to 
locate it in the literature in the precise form we need. The reader who wants 
more details on the part dealing with Desarguesian planes should consult 
Seidenberg [1] and Heyting [1] (see the Notes at the end of this chapter). 
Let Z be a generalized geometry with dim() > 4. Then all the planes in 
-Y are Desarguesian. The classical method of constructing the division rings 
associated to Z is as follows. We take a line t in Z and a plane a in Z 
containing t, and we choose three distinct points O, E, W ont. To this data 
we shall associate a division ring D whose elements will be the points of the 
line ¢ that are different from W. The operations of addition and multipli- 
cation in D will be defined with the help of geometrical constructions (in the 
plane a) that imitate the familiar constructions in the Euclidean plane. 


Addition. Let A eD; then we define the map a4:B>B+A of D with 
itself in the following manner. In the plane a we take distinct lines q, r 
through W, different from t, and distinct points X, Y on q, different from W, 
such that if m(X; r,t) (resp. n(Y; t,r) is the perspectivity from r to t with 
center X (resp. from / to r with center Y), then 7(X; r,t) 7(Y; t,r) takes 0 
to A; then we define (Fig. 1) 


aa =n(X;r, t) n(Y;t,r). 


If we view q as the “‘line at infinity ” then this construction resembles the 
usual Euclidean one (Fig. 1(a)). 
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Fra. 1(a) 
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Multiplication. Given A eD we define right multiplication ma by A; if 
A=0,m,sB=0 by definition for all B and so we take A #0. We take in the 
plane a a line g through W different from t, and a line p through O, different 
from t. We choose distinct points X, Y on q different from W such that 
m(X; p,t)7(Y; t,p) takes E to A (see Fig. 2); then 


ma = 1(X; p,t) n(Y; t,p). 


If we view q as the “line at infinity ”, this resembles the usual construction 
in the Euclidean plane (Fig. 2(a)). Of course we could have equally well 
taken this as the definition of A - B; our choice is dictated by the requirement 
of getting isomorphisms with the geometry of subspaces of a left vector 
space. 

Although these definitions involve the choices of auxiliary lines and 
points, the Desarguesian nature of the plane can be used to prove that 
addition and multiplication are well defined and convert D into a division 
ring with O and Æ as its null and unit elements. Ift, O, E, W are fixed, this 
division ring is independent of the choice of the plane a that contains ¢; for, 
ifb is a different plane containing t, and X is a point in the three-dimensional 
space aVb which does not lie on a or b, the perspectivity from X gives an 


Fra. 2 
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Fig. 2(a) 


isomorphism of #[0,a] with [0,6] which is the identity on t=a^b, and it 
is clear that addition and multiplication on D are the same whether we use 
a or b. The division ring is also unchanged up to isomorphism if we change t 
O, E, W. More precisely, let t’ be another line in the plane a, and O’, E’, W’, 
three distinct points on t’. Then any projectivity in the plane that takes t to 
t' and O, E, W respectively to O’, E', W’, isan isomorphism of the associated 
division rings; this follows from the well-known fact that any projectivity 
of t with ¢’ extends to an automorphism of the plane #[0,a]. We remark 
finally that D is commutative if and only if the geometry of #[0,a] is 
Pappian for any plane a in Z, i.e., the theorem of Pappus is valid in 2[0,qa] 
for any plane a. 

The first step in the coordinatization of # is to define“ affine” coordinates 
for all points of # that do not “lie at infinity”. If Z is a geometry we can 
choose an arbitrary hyperplane to be at infinity. For a generalized geometry 
we use a simple modification of this procedure. To obtain an intuitive picture 
of the definition of coordinates the reader is asked to picture everything in a 
Euclidean context, interpreting incidences at infinity in terms of parallellism. 

Let us call a set of points of 2 independent if every finite subset of the set 
is independent. If{P,}(j¢J) is an independent set of points, then we have a 
map u (K->u(K)) from finite subsets of J into ¥ defined by 


(19) u(K) = VP» u($)= 0. 
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It is easy to verify that 
(20) u(K, Y Ko) = u(K,) V (Ka), u(y A Ka) = u( Ky) Au(K,). 


An independent set C of points of # is called a basis if it is maximal. Every 
independent set is contained in a basis in view of Zorn’s lemma. If C is a 
basis of Z and P is a point of Y, it follows from the maximality of C that 
there is a finite set F €C such that P< Vege pQ; hence for any aeY, there 
is a finite set G cC witha < Voce Q. Let O be a point of X. By a frame at O 
we mean a pair (O,{P;},.,) such that {O,{P;},-,} is a basis for Z. There 
are frames at any point of Z. 

From now on we fix a frame (0, {P;},-,) at O. An element ae is said 
to lie at infinity if there is a finite set K <J such that a<u(K) (ef. (19)). 
It follows from (20) that the elements of # that lie at infinity form a 
generalized geometry, denoted by L. If ac and a€#,, there is a 
unique a,,¢€%,, such that ay <a and a,, contains all elements of S [0,a] 
that lie at infinity; a,, is the unique element at infinity contained in a with 
dimension equal to dim(a)—1. In fact, let KJ be a finite set with 
a<Ovu(K); asa+u(K), dim(avu(K))>dim(u(K)) so that 


avu(K) =Ovu(K); 


if a,, = aAu(K), then dim(a,,) = dim(a)—1, from which the characteriza- 
tions of a,, follow immediately. The uniqueness of a,, is obvious and a,, is 
called the hyperplane at inifinity in a. In particular, if P is a point not in 
Land KEJ a finite set, then for a= Pvu(K), u(K) is the hyperplane at 
infinity. Every line not in #,, has a unique point at infinity on it, every 
plane not in #,, has a unique line at infinity on it, and so on. 

Let m, be the line OVP; (je J). For each jeJ we choose a point E, in m; 
distinct.from O and P,. As explained earlier we convert the set of points of 
m; different from P; into a division ring D, with O and Æ; as its null and 
unit elements, respectively. The division rings D; are of course mutually 
isomorphic but we can do better and introduce a ‘‘ compatible” system of 
isomorphisms. Let j, keJ, j # k. The lines H;VE, and P;V Pp, being in the 
plane m,;Vm,, meet at some point, say P;,; this is the point at infinity of 
E,VE,, and the perspectivity from P,,, of the line m, onto the line m,, 
fixes O and takes E, (resp. P,) to E, (resp. P;). It thus induces an iso- 
morphism of D, onto D;, which we denote by 


(21) Oi: DID, Oir = 7(P5x3 Me, Mj). 


We define 0,; to be the identity. Then the “compatibility ” of the 6,, may 
be formulated as the following lemma. 


Lemma 2.10. If j, k, reJ, then 
(22) 8,5 ° Dix = Orr- 
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Proof. From (21) it is immediate that 6,;=0; he Suppose now that 
j, k, r are three distinct indices from J, X e D,, Y=6;,(X), 2=0,,(Y). We 
must prove that Z=6,,(X). We may assume X40, X#E, Desargue’s 
theorem applied to the triangles X YZ and £,£,#, which are in perspective 
from O, gives a line t on which the points 


(XV Y)A (E; V E) = Pir (Y v Z) (E; V E,) = Pi, 


and (XVZ)A(E,V E,) =Q 


lie. Obviously ¢ is at infinity and so Q = P,,, proving that Z=6,,(X). 
From (22) we see that there is a division ring D and isomorphisms 


(23) ġ: DSD, 
such that the diagrams P: 

D œS D, 
24 
( PND 79:2 


are commutative. We write O and £ for the null and unit elements of D. 

Let H be the set of points of Z which do not lie at infinity. Let us denote 
by W the vector space of all functions with finite supports on J with values 
in D. W is a left vector space over D in the usual fashion. Suppose now 
Pe. We can choose a finite set K c J such that P< Ovu(K). ForanyjeJ, 
u(K —{j}) is the hyperplane at infinity on Pvu(K —{9}) and so the latter 
does not contain P,; thus 


Mx? = m; A (Pv u(K —{)})) 
is an element of D,. If K, € K is such that P < Ovu(K,), it is obvious that 
M,x?=M,x,"; if LEK is finite with P<Ovu(L), then P< Ovu(K OL) 
by (20), and so M; x? =M; 1” =M kar. We thus write, for any jeJ, 


(25) M,P=m,A(PV u(K—-{3})) (P < Ov u(K)). 


Transferring M ;, to D now gives the coordinates of P, i.e., let fp be the 
function from J to D defined by 


(26) fp (j) = g (M;") (jeJ). 


Lemma 2.11. For any P ef, fp is in W; fo=0, the origin of W; and P —>fp 
is a bijection of P with W. 


Proof. If P <Ovu(K) and j¢K, M,” =m;a(Pvu(K))=0, showing that 
fpe W. It is also obvious that fp =0. Let ge W, 249, and let J’={j,,...,Js} 
be the support of g. Let Q,=9;,(8(jr)), ar =Q, Vud" —{j}), P= MS <Ar- 
A simple dimension calculation shows that P is a point. If P were to lie at 
infinity, P <u(J’—{j,}) for all r, which is impossible since 


A uJ" —{9r}) = 0 


1Sr ss 
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by (20). So, Pe and itis easy to verify that fp =g. If P’ e is also suchthat 
f,,=g, it is clear that P’ <Ovu(J'), and hence, for 1<r<s, 


Q,= M,” < P’v uJ" —{9,}) 
so that P’ <a, for all r, showing that P’ =P. 


Remark. For later use we mention here the following easily verified fact: 
let Pe be different from O; R, the point at infinity on the line OvP; and 
jeJ any index such that P+ m,. Then we have a well-defined point S, 
necessarily at infinity, where the line RVP, meets u(K —{j}); and for any 
QEP, Q<OvP, O40, the line QVM, passes through S. 


The map P-fp describes the “affine” geometry of $$ in terms of the 
affine linear geometry of W. The next three lemmas make precise this 
statement, and they constitute the heart of the proof of the main theorem 
of this section. 


Lemma 2.12. Let Pe, P4O. Then a point Q of B lies on the line OVP 
if and only if for some M eD one has 


fo = M ‘fpr. 
Proof. Let K=Kp<J be the smallest finite set €J such that 
P<Ovu(k); 


to avoid trivialities we assume that K has at least two elements. Let Q4O 
be on the line OVP, Qe. As Ko = K, fo(7)4O for all je K and so for each 
je K there is a unique nonzero M,¢D such that M,;2=M,-M,?. We need 
to show that for j, ke K, 74k, 6;,(M,)=M,. If for some je K, M,2°=M,? 
then our remark above shows that P=Q and so we may suppose that 
M,PH#M,?,jEK. Fixj, ke K,j+k. The triangles PM,?M,? andQu,emM,? 
are in perspective from O and so have an axis of perspectivity. This is a line 
at infinity (cf. the remark above) and so 


(M, v M?) A (MVM) =T 


is a point at infinity, which lies on the line P,v P, because all four points are 
in the plane a = OvP,VP,. If m denotes the perspectivity, in the plane 
a=OvP,VP,, of m;tom, with center T, we can use the definition of multipli- 
cation in-D,; to conclude that if N,=(0;,07)(H;), then for any UeD,, 
(9;,0m7)(U)=U-N;. Thus 6;,(1f,°)=M,e-N; and 6;,(M@,?)=M,?-N;, 
proving that 0,,(21,,) = M;. 

For the converse, suppose Q # O is in H with fg =M - fp. Assume as before, 
to avoid trivialities, that K has at least two elements. Choose a jpe K and 
let Q’ be the point on the line Ov P such that the lines Q'V M, 2 and PvM,,? 
meet at infinity. Obviously, Q’ef, Q'#0, and M,°=M,°?. Since 
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fo,.= M’-fp for some M’eD, we have 9,,(M')=9,,(M) so that M'=M, 
i.e., Q=Q’. 

Lemma 2.13 (“parallelogram law of addition”). Let O’e P, O’ +0, and let 
t' be a line through O' not containing O. Let U, R be the points at infinity on 
the lines OVO' and t’, respectively, and let t be the line OV R. If Pe $ lies on 
t and 


(27) P’=(UVP)at, 
then fp = f{p+fo-. 


Conversely, suppose P'e% lies on t'; then fe — fo’ =fp for some P on t, and 
P and P’ are related by (27). 


Proof. For the first part we must show that M,?’=M,?+M,®’ for all 
j¢K where KEJ is a finite set such that 0’, P, P’ are all <Ovu(K). If 
U<Ovu(K—{j}), it is easy to see that M,?’=M,? and M,” =0; if 
R < Ovu(K —{J}), it is likewise easy to check that M,? =O and M,?’=M,°. 
So assume that neither U nor R lies in Ovu( K —{j}). Then t’A(Ovu(K —{3})) 
is a point, say O”; O" £0, and O”e P. Let U’ be the point at infinity on 
OvO”, and P” and O,, the respective points on OVR where U’vP’ and 
U’vO' meet it (see Fig. 3). 


Fia. 3 


As O” < Ovu(K —{)}) we have U’ <u(K —{9}) and so 
M,” = M,?’, M,” = M;. 


It is clear from Fig. 3 that with respect to addition on the line ¢ (minus È), 
P"=P+0,; so, ift=m,, M,?’=M,?+M,%. If tm, let T be the common 
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point at infinity on the lines O,vM,;%, PvM,?, P’vM,?". Consider the 
plane tvm,, and in it, the perspectivity from t to m; with center 7'. As this 
takes O,, P, P” to M,;%, M;P, and M,?”, respectively, we get 


M ,P” = MP +M’. 


For the converse let g=fp-—fo-. Define P=tA(UvP’). Then P and P’ are 
related by (27) and so g=fp. 


Lemma 2.14, Let P,', P,’, Pz be three distinct points in B. Then they are 
collinear if and only if for some A £0 in D we have 


(28) fp, = A ° fp, + (E — A) ° fp... 


Proof. Assume first that the P; are on a line t’. Then (28) follows from 
lemma 2.12 ift’ contains O. If O +t’, we can apply lemma 2.13 with O’= Py. 
If P, and P, are the points on the line OVR corresponding to P,’ and P,’, 


we have 
fp, = fp, +fp,, fp, = fp, +fp,, 


while lemma 2.12 gives a nonzero A eD with fp,= A-fp,. This implies (28). 
For the converse, assume (28) and let t’ be the line P,’v P,’. We may suppose 
that O +7’, as the case O <t’ is trivial. Write O’ =P,’ and define P, corre- 
sponding to P,'. If P, is the point on the line OvP, such that fp,=A-fp,, 
then fp, =fp, +fp,. Lemma 2.13 now proves that Py <t’. 


We are now in a position to introduce homogeneous coordinates for all 
points of FY, Let J be an index set containing J and one extra element which 
we denote by oo. Let V be the left vector space of all D-valued functions on I 
with finite supports, and Z’, the generalized geometry of all finite dimen- 
sional subspaces of V. For any Pe $ we define gre V and y(P) eZ” by 


and 
(30) AP) = D-gp. 


Suppose P is a point at infinity on Z. We choose a point Q on the line OVP 
such that 04 P, QO and define gp,geV, y(P)eL’ by 


felj) = ged, 
an gra [0 i 
’ J = 0, 
and 
(32) AP) = D- £p.. 


Lemma 2.12 shows that y(P) in (32) is independent of the choice of Q and 
so is well defined. 
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Lemma 2.15. The map P—>y(P) is a bijection, of the set of all points of L 
with the set of all points of L', which preserves collinearity. 


Proof. The bijective nature of the map P>y,(P) is a straightforward 
consequence of the definitions and we sketch the argument for proving 
that y preserves collinearity. Fix three points of Z, say P, (1=1, 2,3); we 
must show that they are collinear in ¥ if and only if y(P,) are collinear in 
L. 

Case 1. P;€$ for all i. In this case lemma 2.14 gives what we want. 

Case 2. Exactly one of the P,, say P}, lies at infinity. Let Q be a point on 
Ov P}, 940, Q+ P}. It is trivial that y(P,), y(P,), and y(P}) are collinear if 
and only if for some M eD, fg=M-:(fp,—fp,). Lemmas 2.12 and 2.13 show 
that this is precisely the condition for Q to lie on the line joining O to the 
point at infinity on P,VP,, i.e., for P, to lie on PVP. 

Case 3. P, and P, are both at infinity. Suppose P,, Pa, P, are collinear, so 
that P, is also at infinity. Chooce Q Ee, Q: #0 (i = 1,2) such that: 
(i) Q,<OvP,; (ti) Qi, Qa and P, are collinear. Choose Qe, Q,40 on 
Ov P}. By lemmas 2,12 and 2.13 fo,=M-(fo,—fe,) for some M eD. This 
shows that y(P;), y(P,), and y(P3;) are collinear. Conversely, if y(P,), y(P.), 
y(P3) are collinear, choose Q; ER, Q; #0 in OvP,, 1=1,2,3. Then for some 
M,N#OinD, 

fg, = M -f9,+N -fg,. 


By lemma 2.12 we can find points Q,’ #0, such that 


fo. = — M :fo,, fo, = N : fa,. 
Then 
for = fo, —fo,. 


If X is the point at infinity on Q,’vQ,’, lemma 2.13 and this relation tell us 
that Q, is on OVX. So X=Po, i.e., the P, (1=1,2,3) are collinear. This 
proves the lemma. 

These lemmas imply the following result which is the fundamental 
theorem on coordinatization of geometries. 


Theorem 2.16. Let Z be a generalized geometry with dim(£)>4. Then 
there exists a division ring D and a left vector space V over D such that L is 
isomorphic to the generalized geometry of all finite dimensional subspaces of 
V. V is finite dimensional if and only if L is a geometry. 


NOTES ON CHAPTER II 


1. For more detail on quaternionic geometry see Chapter I of Chevalley 
[1]. The identification of H with C? in Section 2 gives rise to a bijection 
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H*"~> C2" which is C-linear and associates q= (q1, ---, qn) tO ©=(2j,..., Ven) 
where q,=%,1+2%n4xjo. We may regard H as a Hilbert space with scalar 
product (q,r)=Di<ecnGitx* Where q= (qi, n) and r=(rj,...,r,). The 
group of linear automorphisms of H” that preserves (. , .) goes over to the 
subgroup of U(2n,C) that preserves the symplectic form 

(LY) = 2 (LeYnte—YuEntn)s 


1<kgn 


where x= (2, ..., £an) and y= (Yj, +--+) Yon). 


2. The definitions in Section 5 are classical. For any linet in a Desarguesian 
plane 7 let G be the set of all projectivities of ¢ with itself which are compo- 
sitions of two perspectivities: from ¢ to a second line s followed by a per- 
spectivity from s to t. The line s, as well as the centers of perspectivity, are 
arbitrary. Assume that sé and that the two centers, X, Y are distinct. 
Then the projectivity has one or two fixed points according as the line Xv Y 
passes through s^t or not. In the former case the projectivity is called special 
and the point W, which is on s, t and Xv Y, is the unique fixed point and is 
called its canonical fixed point. If O=sAi and W=(XvY)At are distinct, 
they are the two fixed points and are respectively called the first and 
second canonical fixed points of the projectivity. For any W on t (resp. 
distinct O, W, on t), G,, w (resp. G,, 0, w) is the set consisting of the identity 
and the special (resp. general) projectivities of t with itself, with W as 
canonical fixed point (resp. O and W as the first and second canonical fixed 
points). 

The key fact, which follows from the Desarguesian nature of 7, is that if 

o EG, w (resp. o€G,,0,w), o is determined by its action on one point of t 
other than W (resp. its action on one point of t other than O and W), 
independently of the choices of s, X, Y. It is immediate from this that the 
operators a4 and ma are well defined. The possibility of using different 
s, X, Y to define the same projectivity then allows one to prove that G,. w 
and G,,o,w are groups, the former being abelian. This will prove that + 
converts D into an abelian group with O as its null element, and - converts 
D\0 into a group with Æ as unit element. Right multiplication is of course 
in G,,o, w; left multiplication is also a projectivity of t with itself fixing O and 
W, which however is a composition of three perspectivities. Distributivity 
laws are special cases of the more general result that if a is any projectivity 
of t fixing O and W, a is an automorphism of the additive group of D. This 
is an immediate consequence of the extension principle stating that any 
projectivity from one line of 7 to another can be extended to an auto- 
morphism of 7. This principle, in the case of a perspectivity, is an easy 
application of the following lemma: 


Lemma. Let C be a point of 7; t, t' distinct lines through C; and A, A' 
distinct points on t', different from C. Then there is a unique automorphism a 
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of the plane n such that: (i) a fixes each point of t; (ii) a(A)=A’; (ili) for any 
point X not on t, a(X) is on the line joining C to X. 


Proof. Consider first a line u through C distinct from t and two points 
X, X’ on u, distinct and different from C. Let P(u) be the set of points of the 
plane not on u. We define @x, x’ to be the map P(u) > P(u) such that for any 
B in P(u), B’=@x, x‘(B) is the point where the line Kv X’ meets the line 
CVB, K being the point where the line By X meets t (draw a figure). It is 
easy to see that 0 x, x’ is bijective and its fixed points are precisely those on ¢ 
(except for C). If v is the line Bv B’ for some B in P(u), 62, g’ is well defined 
on P(v), and a suitable application of Desargues’ theorem shows that 
Op p'=Ox, x’ on P(u) A P(v). We now define a as follows. For any point 
P<t,o(P)=P;ifP¢tand P ¢t’,a(P)=6a, (P); if P<t’, wetake X outside 
t and t’ and define a(P)=@x,x(P) where X’=6a, a’(X). It is then not 
difficult to show that a is well defined, defines an automorphism of 7 with 
the required properties, and is the only such. 


3. Asa final reference on the foundational aspects of projective geometry 
we mention the first volume of H. F. Baker’s great six-volume classical 
treatise (H. F. Baker, Principles of Geometry; originally published by the 
Cambridge University Press, and republished in 1968 by Frederick Ungar 
Publishing Co, New York). 


4. To von Neumann we owe the most profound and far reaching of the 
coordinatization theorems for complemented lattices. Motivated by the 
examples of projection lattices arising from operator algebras which are 
factors of type II, he carried out a penetrating analysis of complemented 
modular lattices that are not of finite rank and hence do not have points 
(“ pointless geometries” !). Before doing this the classical theorem of co- 
ordinatization had to be reformulated so that it made no reference to 
points; von Neumann viewed it as the establishing of an isomorphism of 
the geometry with the lattice of right ideals of the ring of N x N matrices 
over some division ring D. In the general case he proved that if # is a 
complemented modular lattice with the property that, for some integer 
N >4, 1 is the sum of independent elements (a,),<;<n any two of which are 
“in perspective” (i.e., have a common complement), then @ is isomorphic 
to the lattice of principal right ideals of a so-called regular ring R, which is 
determined uniquely up to isomorphism. Here, regularity means that for 
any a e R, there is xe R such that axa =a; he actually constructed R as the 
ring of N x N matrices over another regular ring S. It must be remarked 
that, in general, the partially ordered set of right ideals of a ring is not 
complemented, and that if a right ideal A possesses a complementary right 
ideal B (i.e., A+B=R, AOB=(0)), then A and B are both principal; and 
further, that the regularity of the ring is necessary and sufficient for the set 
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of principal right ideals to form a complemented partially ordered set; and 
finally, that in this case the set of principal right ideals is a complemented 
modular lattice (so that principal = finitely generated). Of course in all of 
this “right” can be replaced by “left”. The lattice will be irreducible if and 
only if R is indecomposable in the sense that R does not admit any de- 
composition of the form R =R, @ R, where k;40 are two-sided ideals; and 
this is equivalent to requiring that the center of R is a field. Finally, if a ring 
R has the property that its principal right (or left) ideals satisfy the as- 
cending and descending chain conditions, then £R is regular if and only if it is 
semisimple, thus tying in the classical coordinatization theory with the 
Wedderburn theory. 

It was also established by von Neumann that dualities of Y correspond 
naturally to anti-automorphisms of R, and that orthocomplementations 
correspond to involutive anti-automorphisms x->x* which are definite in 
the sense that 2*z=0<+2z=0. 

Rings of bounded operators in a complex Hilbert space are regular only 
when they are finite dimensional; however, the ring of not necessarily 
bounded operators affiliated (in the sense of Murray and von Neumann) to 
a type II, factor is a regular ring, and is in fact the regular ring whose 
principal right ideals form the lattice isomorphic to the lattice of projections 
of the factor. 

Occupying a special place among the complemented modular lattices are 
the continuous geometries also discovered by von Neumann. He charac- 
terized completely the associated regular rings. The lattices of projections 
of type II, factors are orthocomplemented continuous geometries. Roughly 
speaking the axioms for an irreducible complemented modular lattice to be 
a continuous geometry require the completeness of the lattice and the 
continuity of the lattice operations, and lead to the existence of a dimension 
function with values in the unit interval [0,1]. 

Let D be a division ring, V, and V», (m an integer >2) be left vector 
spaces of dimensions k and mk, respectively, over D. We then have a map 
M->Me@®...@M (m summands) of L(V, D) into L(V mD). Using a 
succession of such mappings (with changing “ratios” m), von Neumann 
constructed continuous geometries associated with D which, when D =C, 
were not isomorphic to the projection lattices of type II, factors. 

These scattered remarks should convey a glimpse of the beautiful 
generalization of classical projective geometry that von Neumann erected. 
The reader who wants to get a sharper perspective on these things should 
begin with a study of von Neumann’s book (Continuous Geometry, Princeton 
University Press, Princeton, New Jersey, 1960, which has annotations by 
I. Halperin that illuminate many aspects of the theory and contain refer- 
ences to further literature), and his papers on the subject that appear in 
volume IV of his Collected Works (6 volumes, Pergamon, 1961). 


CHAPTER III 


THE LOGIC OF A QUANTUM MECHANICAL 
SYSTEM 


1. LOGICS 


Let © be any quantum mechanical system. We have seen that one can 
associate with © the partially ordered set #(G) of all experimentally 
verifiable propositions concerning ©. The partial ordering is that induced 
by the implication relation. Moreover, the map, which associates with 
any element of (©) its negation, behaves very much like an ortho- 
complementation. This leads one to introduce axiomatically a class of 
partially ordered sets and to study their properties. We call these systems 
logics. The basic assumption of modern quantum theory can then be 
described (if we anticipate some terminology) by saying that (G) is a 
standard logic. 

Let £ be a lattice under a partial ordering <. By an orthocomplementa- 
tion of Z we mean a mapping 

(|):a—-at 
of Z into itself such that 


(i) _| is one-one and maps # onto itself, 
(ii) a < b implies bt < a’, 
(1) (iii) at =a for alla, 
(iv) a Aat+=0 for alla, 
(v) avat=1 forala. 
We note that the relations 0'=1 and 1+=0 are easy consequences of 
(ii). Moreover it follows easily from (iii) that | is one-one and onto, so 
that (iii) > (i). Finally, we observe that (ii), (iii), and (iv) imply (v). In 
fact, let ae Y and b=ava’. Since both a and a’ are <b and aH =q, 
(ii) implies that bt <a and bt <a’. Hence b'<aAat=0 or b=b=1. A 
lattice Y, with an orthocomplementation | (a—da*), is said to be a 
logic if 
(i) for any countably infinite sequence a,, a2,--- of elements of Z, 
Va an and Nn Qn exist in F, 
(ii) if a@,,a,€f and a; <a there exists an element be Z such 
that b <a,+ and b V a; =đ2. 
42 
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Before proceeding further we make the observation that if a,<a., the 
element b whose existence is postulated in (ii) of (2) is unique and is in 
fact equal to a,+ Adz. To see this, let us write 6, =at Aag. Since b<a,+ 
and b<a we have b<b,. Therefore, using (ii) of (2) again, we find a 
deL such that d<b+ and dvb=b,. Since d<bt, b<d+. Further, 
d<b,<a,* so that a,<d+. Therefore, a, vb=a,<d* which implies that 
d<a,*. Moreover, d<b,<a, also. Consequently, d<a,Aa,+=0 which 
means that b,=bvd=b. 

The element b=a,1Aa, of (ii) of (2) will often be written as a,—4a. 
For any a e Y, we call at the orthogonal complement of a in L. If a<b+, 
then b <a"; we shall then say that a and b are orthogonal and write a | b. 
Suppose a, be Y are two elements such that a <b, ab. As in Chapter II, 
we write 


(3) La, b] ={c:ceL, a<c< Bd}. 


Then under the partial ordering inherited from Z, ¥{[0,b] becomes a 
lattice, in which countable unions and intersections exist and whose zero 
element is 0 and unit element is b. If we define, for any x in ¥{[0,0], its 
orthogonal complement zx’ by x’=2z'Ab, then it can be shown that 
£[0,b] equipped with the orthocomplementation x — x’ is a logic (cf. 
corollary 3.3). 

The central assumption that one makes in any quantum mechanical 
application is that the set of experimentally verifiable propositions is a logic. 
The only thing that may be open to serious question in this is assumption 
(i) of (2) which forces any two elements of ¥ to have a lattice sum; the 
others may be regarded as technical necessities. We can offer no really 
convincing phenomenological argument to support this (cf. Birkhoff-von 
Neumann [1]). If we omit the assumption that ¥ is a lattice the axioms 
become so weak that it is very difficult to avoid pathology in any mathe- 
matical discussion. We point out also that every calculation in quantum 
mechanics is based on assumptions which not only imply that the set of 
experimentally verifiable propositions is a logic but in fact a very special 
one. 

We shall now make a few general remarks. If Z, and Z, are two logics, 
an injection of L into L, is a map f(a > f(a)) of L into L, such that 
(i) f is one-one, f(0)=0, f(1)=1; (ii) if a}, ag, --- is any at most countable 
sequence of elements of Z, f(Vnda)= Val (an) and f(Aa@,)=Af(@,); 
(iii) f(a*)=f(a)+ for all ae L. An isomorphism of L, on L, is an in- 
jection which maps Z, onto La. If L =Z, isomorphisms are called 
automorphisms. The set of all automorphisms of a logic # is a group 
under composition; we denote it by Aut(#). If S, and # are two logics, 
we say that Z is a sublogic of # if (i) S S&F and (ii) the identity map 
of Z, into Y is an injection. 
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Any Boolean o-algebra is a logic provided we define, for any element a, 
a+ to be the complement of a. These logics are of course not very interesting 
from the point of view of quantum mechanics. To obtain a more typical 
example, let us consider a Hilbert space # over the real, complex or 
quaternionic division rings. We denote by ¥(#) the collection of all 
closed linear manifolds of #. If we now define < to mean set inclusion, 
and | to mean the usual operation of orthogonal complementation in 
LH), then it can easily be verified that Z(H) is a logic. The isomorphism 
class of Z(H) depends only on the field D of definition of # and the 
dimension of #. A logic Z is said to be standard if it is isomorphic to 
the logic Y (H) of a separable infinite dimensional Hilbert space over one 
of the three division rings R, C, or H. Modern quantum theory works with 
the assumption that the logic of any atomic system is, if not standard, at 
least a sublogic of a standard logic. We shall make a deep study of standard 
logics in Chapter 1V. We shall also describe there a number of logics that 
are sublogics of standard logics, but are not standard themselves. 

We shall now proceed to derive a few simple consequences of the 
axioms defining a logic. 


Lemma 3.1. Let Z be a logic and a,,a,,...a sequence of elements of L. 
Ifbe Landb a, for all n, thenb|\/, an. Moreover we have the identities: 


(y an)" = /\ Ant, 
(Aat = Yat 


Finally, if adı < az, C <a, and c | a, then c< az — a. 


(4) 


Proof. Since b<a,*, a,<b+ for all n. Hence Vp a,<b+ or b< (Vn a,)+. 
This proves that b | Vna, We now prove the first equation of (4). For 
any M, Am < Vn an, So that (Vn an) <amt. Consequently, (Vn an)t< N m amt. 
On the other hand, Nm am <4a,* for any n and hence a,<(/N m aämt)t. 
This leads to the relation Vn an < (/\m amt) or (Nm amt) < (Vn an)t. This 
proves the first relation in (4). The second is obtained from the first by 
replacing a, by a,* for all n and taking the orthogonal complements of the 
relation obtained. Finally, let a} <a, c<a., and ca. Write a,=a, Vc. 
3 <a and hence there is a de Z such that d| a and a, V d=a;. Since 
dja, and cla, a,|(cVvd). Moreover, a,Vv(cvd)=a,. Hence cvd 
= dt ^ao, which proves that c<a,—a,. 


Lemma 3.2. Let a, b, c be three elements of a logic such that ab and b <c. 
Then 


(5) avb)Ac=(aadnc)v 6. 


Proof. Write d=(aAc)Vb and e=(avb)Ac. It is obvious that d<e. 
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Hence, by (ii) of (2), there exists an element g of Z such that d_|g, and 
dvg=e. Since a| b and the element g satisfies the relations g<avb, 
gb, it follows from the last statement of lemma 3.1 that g <a. Hence, as 
g<e<c, g<aAc<d. However, since g|d, g<d+ also. Therefore g=0. 
This proves that e=d V g=d. 


Corollary 3.3. Let b be a nonzero element of L. Then L[0,b] equipped with 
the orthocomplementation x—>x' defined above is a logic. 


Proof. We must verify (ii), (iii), (iv) of (1) and (ii) of (2). (ii), (iv) of (1) 
are obvious. We shall now prove (iii) of (1). Let <b. Since 2’ | x and 
x’ | b+, one has 2’ | xv b+, while 2’ v x v b+ =1. Thus, we may conclude that 
x't=xVbt. But then 2"=27'!ajb=(rvb')Ab=a2 from (5). We come 
finally to (ii) of (2). Letr<y<b. As Ẹ is a logic, there is a z e Z such that 
z|2 and zvx=y. Since z<b, z=zAb so that z<z’. This proves (2) in 
£(0,5]. 


Remark. It might be observed that (5) is a special case of the usual 
modular law. We may therefore regard any logic weakly modular since 
the modular law (5) is valid, not for all a and b, but only for a, b with 
a| b. The standard logics are the only ones which have had any significant 
measure of success as a framework for the precise formulation and solution 
of quantum mechanical problems. Since these are not modular, it is too 
restrictive to demand in the definition of a logic that it be modular. This 
explains why we have rejected modularity as a property of a logic and 
also why at the same time we have inserted the very much weaker con- 
dition (ii) of (2) which leads to (5). On the other hand, if a logic is modular, 
and separable in a natural sense, then it follows from a general result of 
Kaplansky [1] that it is a continuous geometry in the sense of von Neumann. 
Whether the logic of any atomic system may be assumed to be an ortho- 
complemented continuous geometry is still somewhat of an open question. 
The point is that, unlike a standard logic, a continuous geometry does 
not have points; this circumstance makes the description of the states of 
the logic difficult. 


2. OBSERVABLES 


We shall introduce in this section the concept of an observable associ- 
ated with a logic. The formal definition is given in the next paragraph, 
but we shall first try to motivate the definition. Suppose that G is a 
physical system, ¥(G) its logic, and £é a physical quantity or observable. 
Then in any experiment which an observer performs, the statements that 
can be made concerning € are of the type which asserts that the value of 
€ lies in some set E of real numbers. It is natural and harmless to require 
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that the sets E be Borel. If we denote by z(H) the statement that the 
value of € lies in the Borel set Hc R+, then one has a mapping 


x: E —x(E) 


of Z( R?) into L(G). We shall regard two observables as “‘identical”’ if 
and only if the corresponding mappings are the same. If f is a real valued 
Borel function of a real variable, then we mean by fo & the observable 
whose value is f(r) whenever ¢ takes the value r; to this observable 
clearly corresponds the mapping E —> x(f-1(E)) (cf. Chapter I, sections 2 
and 4). 

Motivated by these remarks we introduce the following definitions. Let 
[F be a logic. An observable associated with L is a mapping 


x: E —>x(E) 
of the o-algebra Z( R+) of Borel sets of the real line into Z such that 


(i) 2(@) = 0, x(R) = 1, 
(6) (ii) if E, FeZ2(R!) and ENF = ø, (E) | x(F), 
(iii) if E, Ez, +- - is a sequence of mutually disjoint Borel sets in Rt, 


z( En) = V a(E,). 


We write 0 or O(£) for the set of all observables associated with &. We 
remark that the properties (i) through (iii) are natural if we want to 
interpret x(E) as the statement that the value of the observable € lies in Æ. 

If X is any set and is an arbitrary o-algebra of subsets of X, then Z 
is obviously a logic provided we define < to be set inclusion and | to be 
set complementation. Theorem 1.4 asserts that the set of observables is 
in canonical one-one correspondence with the algebra of all real valued 
£-measurable functions on X; if x(E — x(E)) is any observable, then the 
corresponding function f on X is the unique -measurable one such that 
f-\(£)=2(£) for all E e BR). 

Suppose that x is an observable and f a real valued Borel function of a 
real variable. Then 


(7) fou: H>2f-*(#)) 
is also an observable. 


Lemma 3.4. Let L be a logic and x an observable associated with L. Then 
for any sequence E, Ez,- of Borel sets 


«(UU En) = V (En), 


(8) x((\ En) = A x(H,,). 
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Suppose further that fı and fz are two real valued Borel functions of a real 
variable t. Let f, o fa denote the function t —> f,(fo(t)). Then 


(9) (Ji 0 f2) ° £ =f, o(fgo2). 


Proof. Since 2(H,) | 2(R'—#,) and z(E,)Væ(Rt—-E,)=1, it follows 
that «(R1—EH,)=2(E,)+. Consequently, in view of the equation (4), the 
second equation in (8) will follow if we prove the first. To prove the first, 
we begin by observing that if A € B, then z(A) <2x(B); in fact, 


2(B) = x(A) v x( B- 4A). 


This said, let H=\|_), H,. Then z(E,) <x(£) for all n so that V, 2(#,) <2(£). 
On the other hand, let F, =E, and for n>1, let F,=#,—\Jj<, H;. The 
F, are then mutually disjoint, F, SE, and \_), F,=H. Therefore, 


a(H) = V x( Fa) < V x(E,). 


This completes the proof of (8). Equation (9) is trivial. 

Let x be an observable associated with Z. A real number À is said to be 
a strict value of x if z({A}) 40. x is said to be discrete if there exists a count- 
able set C={c,, cg,---} of real numbers such that x(C)=1. x is said to be 
constant if there exists a real number a such that x({a})=1; we shall 
then write x=a. x is said to be bounded if there exists a compact set K 
such that «(K)=1. If we define o(z) by 
(10) a(x) = 

- C closed, x(C)= 1 

then o(x) is a closed set called the spectrum of x. Since the topology of R! 
satisfies the second countability axiom, there exists a seguence of closed 
sets C,,C2,--- such that z(C,)=1 for all n and o(z)=(),C,. Since 
x(k! —C,)=0 for all n, 


which proves that x(o(z)) =1. o(x) is thus the smallest closed set C such that 
2(C)=1. The numbers A € o(x) are called the spectral values of x. A strict 
value is a spectral value but the converse is not true in general. x is 
bounded if and only if o(x) is a compact set. A € o(x) if and only if for any 
open set U containing À, 2(U) #40. 

Since the defining axioms of a logic are somewhat weak, it is not reason- 
able to hope for any incisive description of the set of all observables 
associated with Z. In the next chapter we shall examine this question 
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for logics which are associated with Hilbert spaces. For the moment we 
confine ourselves to a remark on discrete observables. Suppose # is an 
arbitrary logic. Let x be a discrete observable and {c,, co, -- -} the set of its 
strict values. Then 2({c,}) and x({c;}) are orthogonal whenever 147 and 


V x({c}) = 1. 


i 


Conversely, let {c;},-p be an at most countably infinite set of distinct real 
numbers and {a,};-p a family of mutually orthogonal elements of Z 
(with the same indexing set D) such that 

Va =1. 

ieD 
Then there exists a unique discrete observable x associated with Z such 
that x({c,}) =a, for alli e D. In fact, we have only to define, for E e@( FR), 

x(E) = ae l. 

It is easy to verify that x is an observable. It is clear that x({c,}) =a, for 
allie D and that x is discrete. The uniqueness of x is obvious. 

Even though many interesting and important observables are discrete, 
there are quantum observables which are not discrete. The construction 
of observables which are not discrete is, however, quite complicated. We 
shall indicate how it can be done in the next chapter for logics which are 
associated with Hilbert spaces. If & is an orthocomplemented geometry 
of finite dimension J, then it is easy to see that every observable associated 
with Z is discrete and has at most N values. Clearly, such logics are not 
suitable models to represent the experimentally verifiable propositions of 
complicated atomic systems. 


3. STATES 


If S is a classical mechanical system, the concept of a state of © is so 
defined (cf. Chapter I) that if © is in a state p, and £ is any observable, 
then ¿ has a value in p. This leads one to postulate that the states are 
points of the phase space and observables are real valued functions on it. 

In quantum theory this has been rejected as irreconcilable with the 
available experimental facts regarding the behavior of atomic systems 
when subjected to refined microscopic observations (cf. von Neumann 
[1], Heisenberg [1]). The modern approach to atomic physics is based on 
the principle that in a given state of the system, an observable has only a 
probability distribution of values and in general no sharply defined value; 
and no matter how carefully the state 1s prepared, there will be some observ- 
ables whose values are distributed according to probability distributions 
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having arbitrarily high variance. Accordingly, if © is in a state ø, one can 
associate, with each observable x, a probability distribution, say P,’, on 
the line, which will be interpreted as the probability distribution of the 
values of x when the system is in the state o. If x is a discrete observable, and 
C1, C2, * * © are its possible values, then P,’ would have mass concentrated 
on the set {c,, C2, * ++}; the probability mass which is then assigned by 
P,’ to the point c, will be the probability that an experiment on ©, 
designed to yield an exact value of x, gives the value c,, when © is in the 
state o. If f is any real valued function of a real variable t, then, for the 
observable f o x, we would have the equation: 


Prob{f o x = a, when © is in the state o} 
(11) = > Peda). 


nif(e,)=a 
With these remarks serving as our motivation we proceed to the formal 
definitions. Let Y be any logic and @ the set of all observables of Z. A 
state function of # is a map 


(12) P:z-P, (xe) 


which assigns to each observable x e0 a probability distribution P, on 
@(h*) such that for any real valued Borel function f on R! and any 
observable x, 


(13) Pyox(#) = P,(f~*(#)). 


Equation (13) has an interesting consequence. We claim first that if 0 
is the zero observable, then P, is the probability measure whose entire 
mass is concentrated at the origin. In fact, if g is the function identically 
zero, then g o 0=0; since g~1(H#)= R! or @ according as 0 e Æ or 0 ¢ E, we 
have, by (13), Po(£) = Po(g~1(£) =1 or 0 according as 0 e E or 0 ¢ E. We 
next observe that if x is any observable and Æ is a Borel set such that 
2(#)=0, then P,(H#)=0. In fact, if z(#)=0, then, for the function f whose 
value is 1 on # and 0 on R?— FẸ, it is obvious that y=f o x is the zero 
observable and hence P, is the measure concentrated at the origin. As 


H={t: f(t)=1}, we have P,(£) = P,({1})=0. In other words, 
(14) P(E) = 0 (He BR), 2(H#) = 0). 


(14) implies that if x is discrete, the mass of P, is concentrated on the 
set of strict values of z. . 

In turns out to be possible to construct all the state functions of Z 
in a particularly simple manner. In order to describe this construction 
we introduce, with Mackey [1], the notion of a question. An observable x 
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is said to be a question if x({0,1})=1; x is then discrete. If a=x({1}), it is 
clear that v is the only question such that 2({1})=a. We shall call it the 
question associated with a and denote it by q4. 

Suppose now that x is an observable associated with the logic # and 
E any Borel set S R. If yp is the function which takes the value 1 on Æ 
and 0 on R?— E, then yç ° x is the question associated with x(£), i.e., 


(15) XEL T = Axer 


In fact, since yg takes only the values 0 and 1, it is clear that y, o x is 
discrete and (xg ° x)({0,1})=1. xg ox is therefore a question, and, as Æ is 
the set where yç takes the value 1, 


(xz ° x)({1}) = 2(#). 
The description of all state functions can now be given in terms of the 
concept of a measure on Z. A measure on Ẹ is a function 
(16) p : a — pla) (ae L) 
such that 


(i) pis real valued and 0 < p(a) < 1 for alae Ẹ, 


p(0) = 0, p(l) = 1, 
(ili) if @,,@g,-++ is a sequence of mutually orthogonal elements of 
£ and a= Vn n then p(a)= >, p(a,). 


Notice that if a,,a,¢€ Z and a,<az,, then there exists, by (ii) of (2), 
be £ such that a, |b and a,=a,Vvb. Therefore, p(a,)+p(b)=p(a,). In 
particular, 


(18) phai) < plaz) ` (a, < a2). 


The concept of a measure on Y is a generalization of the well known 
concept when # is a Boolean o-algebra. However Z need not be distrib- 
utive and hence the structure of a measure on # is much more complex 
than that of a measure on a Boolean o-algebra. 


Theorem 3.5. Let Z be a logic and O the set of all observables associated 
with Z. Let p be a measure on L. If we define, for any observable x e O and 
any Borel set ES Rt, 


(19) P.” (E) = p(x(E)), 
then P, is a probability measure on B( R!) and 
(20) P’ : x —> P, 


is a state function of L. Conversely, if P(x —> P,) is an arbitrary state 
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function of L, there exists one and only one measure p on L such that 
P.(E)=p(x(E)) for all x. 


Proof. Suppose that p is a measure on Z. If E, Ez,- -- are mutually 
disjoint Borel sets of R1 with union Æ, then 2(Z,), 2(#,),--- are mutually 
orthogonal elements of # with x(E)= V, x(£,); moreover, 


P(E) = > P,*(E,). 
Since P,?(R!)=p(1)=1, we see that P,’ is a probability measure on 
BR). If f is a real valued Borel function of a real variable, then 


Přoz(E) = p((f ° x)(#)) 
= p(2(f~*(#))) 
= Pi(f-*(#)). 


This proves that x — P, is a state function of 2. 

Conversely, let P(x —> P,) be a state function of Z. Face P, P, isa 
probability distribution on the real line and as g, is a question, (14) 
ensures that P, ({0,1})=1. Define 


(21) pla) = Pa (i). 


pla — p(a)) is clearly a well-defined, real valued function on # and 
0<p(a)<1 for all a. We shall now prove that p is a measure on #. Since 
qo is the question with q,({1})=0, it follows from (14) that P,,({1})=0. 
This proves that »(0)=0. Similarly, we prove that p(1)=1. Let {a,},.1 
be a sequence of mutually orthogonal elements of Z and let a=\/, a,. 
Let x be a discrete observable such that 2({0})=a! and z({n})=a,, 
n=1,2,---. Let f, be the function on R! whose value is 1 for the point n 
and 0 otherwise. Then f, o x is the question g,, and we have, by (21) and 
(13), 


Plan) = Pz({n}). 
Since P, is a probability measure, it follows that 
>, Pla) = Pz(Z), 
n=1 


where Z is the set {1, 2,---}. If f denotes the function whose value is 1 on 
Z and 0 on R!— Z, f o x is the question q, and hence, exactly as before, 
p(a)=P,(Z), so that 


(22) pla) = > pla). 
n=l 
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p is thus a measure on Ñ. If x is any observable and E e Z(R1), we have, 
on writing g for the function whose value is 1 on E and 0 on R!—E, 


P(E) = Py ox({1}) 
= Panel) 
= p(x(£)). 


Finally, equation (21) implies easily that p is unique. This completes the - 
proof of the theorem. 


In view of the theorem we proved just now, it is natural to define a 
state of a logic to be a measure on Z. The set of all states of Z will be 
denoted by S. If p(a— p(a)) is a state of Z and z(E —>-2(E)) any 
observable, E —> p(x(H#)) is a probability measure on (R+). It is called 
the probability distribution of x in the state p and is denoted by P,?. 

Using the probability distribution P,?, we may define the concepts of 
moments of an observable. The (mean) expected value &(x|p) of an observ- 
able x in a state p is defined by 


(23) (z| p) = f tdP,?(t). 
The variance of x in the state p is defined when &(x?| p) < 00; in this case, 
(24) var(z|p) = &(x?| p) -[E(2| p)}?. 


Clearly var(z|p)>0 and is =0 if and only if P,” is concentrated at the 
single point tp =&(z| p), and then it is natural to say that, in the state p, x 
has a sharply defined value, namely to. 


4, PURE STATES. SUPERPOSITION PRINCIPLE 


Let Z be a logic and S the set of all states of Z. If pi, po,--- is a 


sequence of elements of / and c;, Cz, ++- is a sequence of constants such 
that c,>0 for n=1, 2,--- and c,+c,+---=1, then the function 
p(a —> p(a)) defined by 
(25) pla) = > cnp,la) 

n21 


is easily verified to be a state of Z. We shall write 


(26) p= CaPn- 
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F is therefore a convex set. If x is any observable, it follows at once from 
(26) that 


(27) P= > ¢,P,?. 
n21 

The equation (27) may be given the following interpretation. Suppose 
that one knows that the state of a system is one of p4, P2, +- with prob- 
abilities c4, ¢2,---, respectively. Then, for any bounded observable x, the 
expectation value of x is 5,21 ¢,6(x|p,) which is equal to &(2|p). In 
other words, assuming that the system is in the state p is, statistically 
speaking, equivalent to assuming that its state is one of p4, po,--- with 
respective probabilities c}, c.,---. It is customary to say that the state 
p is a classical superposition of the states p,(n> 1). 

In view of the probability interpretation of (26) and (27), interest 
centers around states for which no representation of the form (26) is 
possible. A state p is said to be pure if the equation 


(28) p=cp,t+(l—c)p, (Osesl1, pi, poe) 


implies that p=p,=p.. We write F for the set of all pure states of 2. 
P is the set of extreme points of the convex set S. 

We may now introduce the notion of superposition. Let 2 be a set of 
states and po an arbitrary state. We say that Po is a superposition of 
states in 2 if the following property is satisfied: 


(29) aceY, pia)=0 forall pe2 = pla) = 0. 

If p=c,p,+CgPot+-:: where cy, C2: + 20, andc,+co+---=1, then p is a 
superposition of the states in the set {p,, P2,- --}. As we shall presently see, 
if Y is a Boolean o-algebra, this is, roughly speaking, the only kind of 
superposition possible, in particular, no pure state can be a superposition 
of other pure states distinct from it. This is in contradistinction to quantum 
mechanics, where the structure of Z, namely the fact that it is standard, 
forces the concept of superposition to be nontrivial even among pure states. 


Theorem 3.6. Let Z be a Boolean o-algebra of subsets of a space X such 
that (i) L is separable (cf. Chapter I) and (ii) {a} € L for alla e X. For any 
a e xX let 8, be the state defined by 


30 sep [1 Y 084 
(30) ad if a¢A 


Then {8, : a e X} is precisely the set of all pure states of L. If Dis any set 


of pure states, and p, an arbitrary pure state, po 18 a superposition of states 
of D if and only if pE 2. 
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Proof. Let {A,, Ao, ---} generate Z. That 5, is pure is trivially verified. 
Suppose that p is a pure state. If for some A, E Z, 0< p(A,) <1, then, on 


putting 

P(A) = (p(Ao))“*p(A A Ad), 

P(A) = (1—p(Ao))~*p(A A (X—A))), 
we would obtain the decomposition 


p = p(Ao)-pi+(l—p(Ao))- P2; 
Pi F Po a8 p,(Ay)=1 and p.(A,)=0. This is not possible as p is pure by 
assumption. Therefore we may conclude for any A e Y, that p(A)=0 or 1. 
By replacing A, by X —A, if necessary, we may assume that p(A,)=1 for 
all n, where {A,, Ag,---} generates Z. Let 


B=()A,. 


Then p(B)=1. In particular, B is nonempty. We assert that B cannot 
contain more than a single point. In fact, the collection of all sets Ce L 
with the property that either BCC or B A C= ø, is a o-algebra which 
contains all the A,. Hence it coincides with Y. Therefore, as {xz} e F, 
this is possible only if B={a} for some a e X. But then p= ô,. Finally, let 
Po be a superposition of the states of 2 where each element of 2 is pure. 
If po=6,,, but po E 2, then p({ao})=0 for all pe 2 but po({a.})=1; a 
contradiction. The theorem is completely proved. 

Let Z be an arbitrary logic and F the set of all pure states of Z. For 
any subset SY, we write S for the set of all p e A such that p is a super- 
position of the states of S. Let us now consider Æ, the collection of all 
SSZ with the property that S=S. Under set inclusion, M becomes a 
partially ordered set. If Y satisfies the assumptions of theorem 3.6 for 
example, then M is the class of all subsets of and is therefore a Boolean 
algebra. However, in the case of more complex L, M is not a Boolean 
algebra. In general, the geometric structure of Æ seems somewhat hard 
to determine. For the so-called standard logics to be introduced in the next 
chapter, this can be done and it becomes possible to determine the 
structure of “ completely. For standard L, M then becomes isomorphic 
to Z. It is this fact which confers on the set of pure states of a standard 
logic a geometric structure characteristic of quantum theory. In the 
physical literature (cf. Dirac [1]) this is elevated to a physical principle, 
the so-called superposition principle. From our point of view, the geometry 
of pure states will appear as a consequence of the structure of the logic L. 


5. SIMULTANEOUS OBSERVABILITY 


One of the features which sets apart quantum mechanics from classical 
mechanics is the existence of observables whose values cannot always be 
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simultaneously measured. We shall now introduce this idea formally. We 
begin with the elements of Z themselves. Let a, b e Z. We shall say that 
a and b are simultaneously verifiable, a — b in symbols, if there are elements 
a, 6,, and c e £ such that 


(i) ai, bı, and c are mutually orthogonal, 


(31) (11) Qa=a, V C, b = b, Ve. 


If x and y are two observables, they are said to be simultaneously observable 
if for any two Borel subsets E and F of Rt, z(E) <> y(F); we shall write 
x> y. If A and B are two subsets of Z, we write A +> B when a +> b for 
all a e A and be B. 

The analysis of these concepts depends on a study of Boolean sub- 
algebras of Z. A subset £ € is a Boolean subalgebra of £ if it is (a) a 
sublogic of # and (b) a Boolean algebra, i.e., if 


(i) 0, leg, 

(ii) ifa, be £, thena v b,a A b, and ate <£, 

(iii) if a, b, c e £, then . 
an(6vece)=(aab)v (aac), 
aVvV(bAc)=(av b)A (ave). 


SS is said to be a Boolean sub-o-algebra of £ if it is a Boolean subalge- 
bra, and if, for any countable sequence a}, ag, EL, \/,a,6€ L and 
Nnn EL. 


Lemma 3.7. Let a, be L, L being any logic. Then the following statements 
are equivalent: 


(a) a «>b, 

(b) a—(a ^ b) Lb, 

(c) b—(a A b) La, 

(d) there exists an observable x and two Borel sets A and B of the real 
line such that x(A)=a and x(B)=b, 

(e) there exists a Boolean subalgebra of L containing a and b. 


(32) 


Proof. Suppose a «>b. We may then write a=a, vc, b=b, Vc, where 
a,, bı, and c are mutually orthogonal. Clearly c<aAb. Hence there is a 
d such that c|d and cvd=aJAb. Since dj c and d<a, d<a—c=a, 
(lemma 6.1). Similarly, d<b,. Hence d<a,A6,=0. Therefore c=a Ab. 
Hence a,=a—(aAb). But a, |c and a, Lb, so that 


a,=a-(aAb)1Lb,vc=b. 


This proves that (a) > (b). By symmetry, (a) = (c). If a— (a Ab) Lb, then, 
on writing a,=a—(aAb), b,=b—(aAb) and c=aAb, we find that 
a=a,Vc and b=b, vc. Since a, | b, we have a, |b, and a, |c, while, by 
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definition, b, | c. Therefore a «> b, proving that (b) = (a). (a), (b) and (c) 
have thus been proved equivalent. We shall complete the proof by showing 
(a) => (d) => (e) > (a). If a=a, vc, b=b, Vc, where a,, bj, and c are 
mutually orthogonal, we write d =a, V b, Vc and define x to be the discrete 
observable such that 2z({0})=a,, æ({1})=b,, x({2})=c and z({3}) =d+. 
Then 2({0,2})=a, x({1,2})=b. This proves that (a) = (d). Suppose that 
(d) is satisfied. Then 2(A—AN B)=a—aAQb, 2(B—AN B)=b—anb. 
Writing a, =a —a A b, ag=aAb", ag=b—anb, a,=(avb)*, we see that the 
a, are mutually orthogonal and a, V a, V a3 V a,=1. If is the collection 
of elements which are the lattice sums 


a4 Va Ve VG, (KS 4,154, Sisi < 4), 


1 


then it is easy to verify that & is a Boolean subalgebra of Z. Since 
a,b e x, we see that (d) = (e). Finally, let (e) be satisfied, and let Z be 
a Boolean subalgebra of Z containing a and b. Clearly [a — (a A b)] Ab=0. 
But, as a— (a ^ b), b and b+ are in the Boolean algebra £, 


a—(a A b) = {(a—(a A b)) A b} v Y(a—(a A b) A B+} 
= {a—(aAb)} A bt 
< bt, 


showing that a— (a Ab) Lb. This shows that (e) > (b) and hence that 
(e) = (a). 


Corollary 3.8. If a «> b, the elements a,, b,, and c of (31) are uniquely 
determined. In fact, c=a ^b, a, =a—a ^b, and b, =b—a Ab. 


Lemma 3.7 shows that in the presence of simultaneous verifiability of 
two elements of #, one can operate with the statements corresponding to 
these elements as if they were classical. We shall now take up the question 
of generalizing this fact to observables. Our aim is to prove that stmul- 
taneous observability is essentially a characteristic of classical systems. We 
shall deduce, as a corollary, that, if x, y, z,--- are observables, associated 
with a logic, they are mutually simultaneously observable if and only if 
they are all functions of a single observable. This was first proved by von 
Neumann (cf. [1]) when the logic in question was a standard one. For a 
general logic, it was proved by Varadarajan [1]. 


Theorem 3.9. t Let Z be any logic and {x)},en a family of observables. 
Suppose that xı 4> x, for all à, X e D. Then, there exists a space X, a 


t The definition of a logic given in Varadarajan [1] does not require it to be a 
lattice. It was pointed out to us by Dr. S. P. Gudder [1] that theorem 3.9 does not 
remain valid for these more general classes of partially ordered sets considered in 
our paper [1]. 
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a-algebra Z of subsets of X, real valued B-measurable functions g, on 
X(à e D), and a o-homomorphism r of B into L such that 
(33) t(9,~ (E)) = &,(4) 


for all Ac D and E e &R'). Suppose further that either L is separable in 
the sense that every Boolean sub-c-algebra of L is separable, or that D is 
countable. Then there exists an observable x and real valued Borel functions 
fa of a real variable such that for all Xe D, 


(34) La = faoa. 
The proof of this theorem (cf. Varadarajan [1]) depends on a series of 
lemmas. Y is a fixed logic in these lemmas. 
Lemma 3.10. Let b, a,, Qo, -> be elements of L. If b <a, for all n, then 
b 4> apt, 


(35) b 4> V a,, 
b 4> N ay. 
Moreover, we then have the distributivity laws 


bA (V a) = V (b A a,), 


n 


bv (A an) = A (0 V a). 


Proof. We use the criteria of (32). It follows from the equivalence of 
(a) and (d) of (32) that if c <> d, then any two of the four elements c, c+, 
d, d+ are simultaneously verifiable. This proves the first of the three 
relations of (35). Now, we come to the proof of the rest of (35). Write 
a=\/ mm. Since bAa,g<bAa, b—(bAa)<b—(bAa,) for all m. As 
b <> am for all m, it follows from (c) of (32) that 


(36) 


b—(b A a) < b—(b A am) < amt. 


Hence b— (b Aa) < Nm amt =a which proves that b 4> a. Changing all the 
a,, into a, we deduce from the relation b <> Vn a, that b <> Nn an. Next 
we come to (36). Once again it is enough to prove only the first of the two 
relations. Clearly \/, (bAa,)<bAa, where a=\/,a,. Let d<baa be 
such that d| V, (bAa,) and (V, (bAa,)) Vd=b Aa. Since d | bAa, and 
ad<b, we deduce from the relation b «> a„ that d<b—(bAa,) (by lemma 
6.1) and hence, as b <> a,, we conclude that d_|a,, from (c) of (32). Since 
this is true for all n, d] a. As d<a, this means d=0, proving the first 
equation of (36). 

Consider the collection M of sublogics of Z, partially ordered by in- 
clusion. If {¥,},.4 is any family of sublogics of &, it is obvious that 
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On Fa is also a sublogic of Z. From this it follows that if {%,},., is any 
family of sublogics of £, there exists a smallest sublogic containing all the 
La We denote it by Vae La. M is thus a complete lattice. If each Z, is a 
Boolean subalgebra of Z, it is however not true in general that Va Za 
is a Boolean subalgebra. We shall now prove that this is so if and only if 
the relations Y, > Fx are satisfied for all «, a’ e A. We begin by con- 
sidering the case when there are only two Boolean subalgebras involved. 


Lemma 3.11. Let Z, and Z, be two Boolean subalgebras of L. Then 
2,3 R, if and only if 2, V B, is a Boolean subalgebra of L. 


Proof. Suppose that 2, V2, is a Boolean subalgebra of #. Then 
Bı 4> 2, in view of (e) of (32). We shall now prove the converse. We shall 
first consider the case when the Boolean subalgebras are finite. Let F, 
(t=1, 2) be finite Boolean subalgebras of Z such that F, 4> F2. Then, 
their Stone spaces (cf. Chapter I) are finite and hence there are elements 
Q,,°°:,a,€F, and b,,---,b,e FA, such that (i) a, lay, 140’, b; Lb, 
jÆ); (ii) a, VagV +++ Va, =b, Vba V +--+ Vb,=1; (ili) F is the collection of 
all lattice sums a,, Va Vee VQ, (lS%,)St2<--+<t,<7) and F, is the 
collection of all lattice sums b; V b,, V-:-V 6, (lL Sji Sja <s 
Ja < 8). Write 


(37) Cy =a, A b; (1 = 1,2,---,7, 4 =1,2,---, 8). 


Obviously c,; | cyp unless 1=0’ and 7=)7’. Since a, <> 0, for all i and J, 
we infer from (36) that 


V Gy = ay 
j=1 
(38) , 
V cy = b; 
i=1 
Consequently, 
r $ 
(39) V Cy = l. 
i=1 j=1 


If F denotes the collection of all lattice sums of the c,,, then F is a 
Boolean subalgebra of # and (38) implies that F S F for 1=1, 2. The 
formulas (37) show that if F’ is any sublogic of Z containing both F, 
and Fa then ACF’. Clearly, therefore, F=F,VFy,. Note that 
FV Fy is also finite. 

Let Z, (t=1, 2) be Boolean subalgebras of Z such that Z, > Z,. Let 
& be defined by 


(40) R= U (F1 V Fa), 


where the union is over all pairs (F ,,F 2) of finite Boolean subalgebras 
of Y such that F SZ, 1=1, 2. We assert that Z is a Boolean subalgebra 
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of Z containing #, and &,. If ac Z, then it is obvious that a+ e 2. 
Clearly, 0, 1 e 2. Suppose now that a, b e Z. Then there are finite Boolean 
subalgebras Fi, F,', Fo, Fo of L such that Fi, F,' are SRi, Fo, Fo 
are C&,, aE F,VFyq, and bE Fi VF. Since F, and F,’ are con- 
tained in the Boolean algebra 2,, F, 4> F’ and hence FV Fi =F,” 
is finite and contained in @,. Similarly, Fa V F? =F” is finite and 
contained in #,. As R, 4> Bo, Fi” 4> F”. Therefore F,” V F” is a 
finite Boolean subalgebra of Z contained in Z. Let us write F =F," V F3". 
As Fi, Fi Fz, and F,’ are contained in F, it follows that F, V F2cF 
and Fi VF SF. Thus, a,be F. Therefore avb and ab are in F. 
This proves that av b and a ^b are in &. 

We shall next establish the distributivity laws. Let a,, a2, a, be three 
elements of Z and (for j=1, 2, 3) let ape FPV FP, where F is a 
finite Boolean subalgebra of # contained in 2, (j=1, 2, 3, k=1, 2). Since 
FHC, for all j, \/3?., F$ is a finite Boolean subalgebra of &,. Let it 
be denoted by F%®. Obviously FP <> F, and hence F=FY y F™ is 
a finite Boolean subalgebra of Z. Since FCF for all j and k, 
FWy FSF for all j so that ae F for all j. As F is a Boolean sub- 
algebra of Z, we have: 


a, A (ag V ag) = (a, A Gg) V (a, A Gz), 
dı V (a3 A a3) = (dı V ag) A (Q, V Gs). 


This proves that Z is a Boolean subalgebra of X. 

We observe finally that #, and @, are contained in &. If a e Z,, we 
have ae F, VF, where F,={0,a,a',1} and F ={0,1}. Hence Z, S2. 
Similarly, Z, €. Z, VZ is thus a Boolean subalgebra of ¥Y. The 


lemma is completely proved. 


Corollary 3.12. If & is defined by (40), then =R, V B. If C is any 
subset of L such that € <> By, k=1, 2, then C 4> F, V Ra 


Proof. If F, is a finite Boolean subalgebra of #, (k=1, 2) and if Z’ is 
a sublogic of Z which contains both #, and Rz, Fı V F€. Hence 
RARER’. This shows that Z, VZ, coincides with #. To prove the second 
assertion, let ce g. Let F,<c&,, be finite Boolean subalgebras of Z. 
From (37) and (35) we may conclude that c «> F v F. Equation (40) 
shows then that c <> 2. This proves that ¢ <> #, V 2a. 


Corollary 3.13. Let Z, (t=1, 2,---, s) be Boolean subalgebras of L such 
that 2,<>B, for all i,7=1, 2,---,s. Then V=, Z, is a Boolean sub- 
algebra of L. If E is any subset of L such that € <> &, for allt =1, 2,---, 8, 
then € <> Vai &,. 


Proof. The assertions are proved by induction on s. For s=2, they 
follow from lemma 3.11 and corollary 3.12. Assume now that they are 
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valid for all s<sọ and let s=sọ+1. Since Z, «> Z, for all i,j <8, +1, 
R=\/i., Z is a Boolean subalgebra of Z and &,, ,, 4> Z by the induc- 
tion hypothesis. Hence, Z., +ı VZ is a Boolean subalgebra of Z. Suppose 
now that c is an element of Z such that c +> Z, i<s 9+1. Then, by the 
induction hypothesis, c+» 2. Hence we conclude from corollary 3.12 that 
c<> \/o*! 2,. This completes the induction and the proof of the corollary. 


Lemma 3.14. Let {2,},.p be a family of Boolean subalgebras of L such 
that Ry «> Ry, for all à, X e D. Then there exists a Boolean subalgebra 2 
of L such that RSZ for all AX D. 


Proof. Let F be a finite subset of D. By corollary 3.13, V er Z, is a 
Boolean subalgebra of Z. Let us write 


(41) AF) = V, Aa 
Obviously, 2(F)cA(F’) if FoF’. Define Z by 
(42) R= U AF). 
FED 
F finite 


Obviously, 2, S2 for all Ae D. We shall now complete the proof by 
showing that 2 is a Boolean subalgebra of #. It is enough to prove that 
any finite subset of Z is contained in a Boolean subalgebra of Z contained 
in &. If a,,ag,---,a, are elements of @ there are finite subsets F., 
F,- +, F, of D such that a; e A(F;). I£ F=); F, then a e &(F) for all 
i. This proves that 2 is a Boolean subalgebra of Z. 


Corollary 3.15. Let {2,},-p be a family of Boolean sub-o-algebras of L 
such that R, 4> R, for all à, à e D. Then, there exists a Boolean sub-o- 
algebra & of L such that 2 EZ for all Xe D. 


Proof. By lemma 3.14 there are Boolean subalgebras Æ’ such that 
A, ER for all A. By Zorn’s lemma we can choose a maximal such, say 2. 
We claim that Ž is a Boolean o-algebra. It is enough to prove that if 
di, Q2, * +- is a sequence of elements of Z, then a=\/,, äm is also in 2. 
Since am E Z, b 4> am for all b e Z. Hence a «> 2 by (35). If £ ={0,a,a+,1} 
then » is a Boolean subalgebra of Z and £ «> Z. Hence WvZ is a 
Boolean subalgebra of # containing 2. Since Z is maximal, 7 VZ =2. 
Therefore a e &. 

For any observable x associated with Z we write 


(43) A(x) = {x(E) : E e BR). 
We shall call (x) the range of z. 


Lemma 3.16. Let Z be a Boolean sub-o-algebra of L. Then, in order that 
AR=R\(x) for some observable x, it is necessary and sufficient that Z be 
separable. 
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Proof. If Z=A(x) and 2={x(E): E an open interval with rational 
end points}, then 2 is a countable subset of # and & is the smallest sub-o- 
algebra of itself containing 2. Z is thus separable. Conversely, let Z be a 
separable Boolean sub-o-algebra of X. By theorem 1.3, there exists a 
space X, a o-algebra Z of subsets of X, and a o-homomorphism A of Z 
onto &. Let {A,, Ag,---} be a sequence of subsets of X such that A, E Z 
for all n and {h(A,) : n=1, 2,---} generates Z. Let Z, be the sub-c- 
algebra of Z generated by {A, : n=1, 2,---}. By theorem 1.6(i), there 
exists a o-homomorphism u of Z( R!) onto Z,. Define x by 


(44) x(#) = h(u(k)) (He &(R?)). 


It is obvious that x is an observable and Z=&(z). 

Now to the proof of theorem 3.9. Suppose that x is an observable and 
2%, =f,0x (AED), where each f, is a real Borel function on R1. Then 
#(x,)SA(x) and hence &(x,) <> R(x) for all A, A e D. This proves that 
2, «+ Xy, for all A, A’ e D. Conversely, suppose that {x,} is a family of 
observables and that xxv, for all A, A’eD. By corollary 3.15, there 
exists a Boolean sub-o-algebra R’ of L such that H(z, ) ER for all A. 
By theorem 1.3, there exists a space X, a o-algebra Z of subsets of X, 
and a o-homomorphism r of Z onto &’. It is now clear that for each 
à E D, there exists, by theorem 1.4, a real valued -measurable function 
g, on X such that, for all E e Z( Rt), 


a,(#) = 7[g,7*(£)] (àe D). 


This proves the equation (33). For proving (34) let Z be the smallest 
sub-o-algebra of Z’ containing all the Z(x,). If Z is separable, so is Z. 
On the other hand, if D is countable, and 2, is a countable set generating 
Alx), Ua 2x is a countable set generating #. Hence & is separable in 
either case. By lemma 3.16 there exists an observable x such that R(x) = 2. 
As R(x) E &(x), we conclude from theorem 1.6(ii) that there exists a real 
Borel function f} of a real variable such that 7,=f, ox. This proves (34) 
and completes the proof of the entire theorem. 


Remark. It is obvious that if the x, satisfy either (33) or (34), 2, z, 
for all A, àe D. Theorem 3.9 therefore tells us that the classical logics are 
precisely those for which any two observables are simultaneously observ- 
able. As soon as ¥ is not a Boolean algebra, it follows that the properties 
of a physical system which are embodied in the logic Y have too complex 
a structure to be exhausted by a single set of simultaneously observable 
quantities. In general, when certain propositions are experimentally 
verified, certain others cannot be so at the same time. 


62 GEOMETRY OF QUANTUM THEORY 


6. FUNCTIONS OF SEVERAL OBSERVABLES 


The characterization given in theorem 3.9 of simultaneous observ- 
ability enables us to construct a calculus of functions of several observables 
which are mutually simultaneously observable. 


Theorem 3.17. Let L be any logic and x1, ,---, x, observables such that 
xı 4> 2; for all i, j. Then there exists one and only one a-homomorphism + 
of BR") into L such that for all E e B(R*) and all 1=1, 2,---, n, 


(45) x(H) = r(a,~*(H)), 


where m, is the projection (ti, tg,++-+,t,) >t, of R” on R'. If g is any real 
valued Borel function on R”, 


(46) go EZE Tartt, Xn) : E — t(g~*(E)) 


is an observable. If 9,, 92, ***, Jy are real valued Borel functions on R" and 
Yi =g; 0 (£1, ***s Ln), then y3,°--, Yy are simultaneously observable, and for 
any real valued Borel function h on R", 


(47) h o (Yis: ts Yk) = (Agiss Gud) © (ttt Ta)» 
where h(g, +, gy) 18 the function 
t = (t, t? t3 ta) —> h(gx(t), .. ty g,(t)). 


Proof. By theorem 3.9, there exists an observable x and real valued 
Borel functions f,,---,f, of a real variable such that 2,=/f, o x for all t. 
Let the Borel map f of R? into R” be defined by 


f : t —> (filt), <- +, falt). 
If we define, for M e Z(R”), 


(48) 7(M) = x(f-*(M)), 


then 7 is a o-homomorphism of Z(R”) into # which satisfies (45). If 7 
and 7’ are two c-homomorphisms of @(R") into Z satisfying (45), then 


whenever M is of the form E, x E,x---xE,, the E, being Borel subsets 
of R+. Since the class of such sets generates @(R"), it follows that r=7’. 
The fact that g o (z,,---,2,) is an observable is trivial. Note that its 
range is contained in the range # of r which is a Boolean sub-o-algebra 
of L. 
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Suppose now that y; =g; ° (£1, ***, Za). Then the range of y; is contained 
in 2 and hence y,,---, y, are simultaneously observable. Let g be the 
map t —> (g,(t),---, 9,(t)) of R” into R*. Then 


y: M —> 7(g7*(2)) 
is a o-homomorphism of 4(k*) into # such that 
ym (Z)) = (9,7 *(#)) = y(E), 

a, being the map (S4, S2, © * +, Sk) > 5, of R* on R? (1=1, 2,---, k). Therefore 
we conclude from the uniqueness of the map r in (45) that, for 

u = ho (Y1, Yost +s Yi)» 

u(E) = y(h-(H)) = (He BR). 
But y(h-1(E))=7(g71(h-1(E))), so that we may conclude that 
ho (Ya, 05+) Yr) = (Agis, Gu)) © (Lay +++ La). 


This proves the theorem. 


Theorem 3.17 enables us to define the joint distributions of z4, ..., £n. 
Let p(a — p(a)) be a state of Z and 2,,---,z, observables such that 
x,<+ x, for all 1, j. We may then clearly define the probability measure 
PÈ,- z, 00 BR") by 


(49) Pri tn(M) = p(r(M)). 


PÈ x, 18 called the joint probability distribution of (x1, --, £) in the 
state p. It follows easily from (47) that for Borel Mc R*, 


(50) Py (M) = P,e (87*(M)), 


V1ie°**s1 Uk 


where g is the map 


t= (ti, e.e, b) |> (9,(t), ar) g;.(t)). 


We see from (50) that the rules for the calculation of the joint probability 
distributions are the standard ones of probability theory. 


7. THE CENTER OF A LOGIC 


We begin by recalling the discussion of Chapter II in which we singled 
out the geometries among the modular lattices of finite rank by demanding 
that their centers be trivial. We shall now do the same for logics. The 
irreducible logics would then have properties very far removed from those 
of Boolean algebras and would consequently serve as plausible models for 
the logics of atomic systems. 
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Let £ be any logic. We define the center of Z to be the set @, where 


(51) €={a:aeL, arb, forallbe 2H}. 


@ is nonempty since 0 and 1 are in @. The elements of @ are thus simul- 
taneously verifiable with every element of X. If x is any observable with 
#(u)S@, then for any observable y, x«>y. Such observables x are 
called central. L is said to be irreducible if 6 = {0,1}. Clearly, # is irreducible 
if and only if the only central observables are the constants. 


Theorem 3.18. Let Z be any logic and @ its center. Then € is a Boolean 
sub-o-algebra of L. 


Proof. The equations (35) and (36) reveal immediately that @ is a 
Boolean subalgebra of #. If a,,---,a,,--+ is a sequence of elements in 
G, then b «> a, for allr(b e Z) and so V, a, <> b by (35). Hence \/, a, E E 
also. This proves that @ is a Boolean sub-o-algebra of 2. 

Since the Boolean o-algebras are relatively familiar, one may ask 
whether there is a decomposition of Z over @ analogous to that for 
modular lattices of finite rank. There is no difficulty in doing this when 
@ is discrete, i.e., when there exists an at most denumerable set {a,}.ep 
of mutually orthogonal elements of @ such that (i) \/,a,=1; (ii) € 
consists precisely of all the lattice sums Vez 4,, where Z is an arbitrary 
subset of D. The a, are called the atoms of €. If Ẹ is not discrete, then 
such decompositions seem much more difficult to obtain. 

On the other hand, let us define to be continuous if, given any ce @ 
which is nonzero, there exists a nonzero c E€@ such that c,<c, c,<c. 
We shall prove that when @ is continuous and separable, Z does not 
have any pure states. 


Theorem 3.19. Let Z be any logic and let € be its center. Let P be the set of 
pure states of L. Suppose that G is discrete and that {a, : n e D} is the family 
of atoms of €. Then for each ne D, £[0,a,)=L£, is an irreducible logic. 
If p is a pure state of L, and we define p~ on L by 


(52) p-(a)=plara,) (ae), 
then p~ is a pure state of L. If 

(53) Pa ={p~ : p a pure state of £,}, 
then the {P : n e D} are disjoint subsets of P and 

(54) P=U Pa 


nEeD 


If € is continuous and separable, P is empty, i.e., Z has no pure states. 
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Proof. We have already noted that for any n, Y[0,a,]=/,, is a logic 
(corollary 3.3). We prove that Z ,„ is irreducible. In fact, let c<a, be such 
that c«> Z, Clearly, c++ a for all a<a,. On the other hand, as a, € G, 
a,«>6 for any beZ, so that b=(bAa,)V(bAa,") (cf. (36)). Now 
C 4> b Aa, as bAa, <a, while c++ 6Aa,* since c|bAa,*. Therefore (35) 
enables us to conclude that c <> b. This shows that ce € and hence that 
c=0 or c=a,,. This proves the irreducibility of Y,. 

Suppose next that p is a pure state of #,. Equation (52) defines p~ as 
a function on Z. From the equations (35) and (36) it follows that p~ is a 
state of &. We claim that p~ is a pure state of Y. Suppose that p~ =cp, + 
(1 —c)pa, where 0<c<1 and py, Pa are states of Z. Since p~(a,+)=0, 


pi(a,") = Polan) =0. 
Hence 7,(a,)=p.(a,)=1, and hence the restrictions of p, and p, to Z, 
are states of Y,. Since p is a pure state of Z» it follows that 


pla) = pi(a) = pala) 
for all a<a,. But then for any be Z, we see that p~(b)=p,(b) =p.(d) 
since, by (36), b=(bAa,)V(bAa,+) and p~, Pi, Pa vanish for b Aa,?. 
p~ is thus a pure state of L. If m, ne D and m4n, J, A P = Ø, for, 
if q E Pm and q’ E Pas g(a,)=1 while q’(a,,) =0. 
We show now that each element of F lies in some Y,. Suppose that p 
is a pure state of Z. We claim that p(a,)=1 for some n e D. Otherwise 
there will be some m e D such that 0< p(a,,)<1. Define q, and q, on L 


by 
Qi(@) = (P(4n))~*p(a A am), 
qala) = (1—P(am)) T P(A A amt). 


Since a,,€@, it can be easily verified using (35) and (36) that qı and q3 
are states of Z. Since q,(a,,)=1, and qa(dm)=0, q1 Æq. AS 


P = Plam) Qa + (1 —P(Am))- Ge, 
p is not pure, a contradiction. Therefore p(a,)=1 for some n e D. The 
argument given in the previous paragraph can now be repeated to enable 
us to conclude that the restriction of p to &,, say q, is a pure state of 
F, and that p=q~. This proves that p e F, and completes the proof 
of (54). 

We now come to the proof of the last assertion. Assume that ¢ is 
continuous and separable. Suppose that p is a state of Z. We assert 
first that for some c € @, 0<p(c) <1. If this is not true, then p(c)=0 or 1 
for each c e @. Since @ is separable, there exists a countable set 


GQ= {c1, Ca 3} 
such that 2 generates @. Since p(c,)=0 or 1, we may, by replacing c, by 


(55) 
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C,* if necessary, assume that p(c,)=1 for all n. Let co= Nn Cy. Clearly, 
p(Co) =1. In particular, co 40. We assert that co is an atom of @. In fact, 
the subset of all elements c in @ such that either c_| cy or c> co, is a sub-o- 
algebra of € containing all the c, and so coincides with @; this shows that 
if a<cy and a e @, then a=0 or a=Cy. This contradicts the assumption 
that ¢ is continuous. Consequently, our assumption that p is two-valued 
on @ is untenable. Let then d e be such that 0< p(d)<1. Define q, and 
qa on L by 


qi(a) = (pd) pla Ad), qala) = (1—p(@))~*p(a A d+). 
Then p= p(d)q, +(1—p(d))g2 and p is not pure. The proof of the theorem 
is complete. 


We shall end this section with a few remarks of a general nature regard- 
ing the Heisenberg uncertainty principle. We must point out that a 
considerable part of its meaning is physical and we do not go into this 
here (cf. the discussions of von Neumann [1] and Heisenberg [1]). We 
shall be concerned only with mathematical formulations. 

Suppose that © is a quantum mechanical system and #=¥(G) its 
logic. For any observable x associated with Z and any state p of Z, we 
write o,7(z) for the variance of x in the state p (defined only when 
6 (x?| p) < 00): 

(56) op (x) = &(x| p)—(F(z|p))?. 

If o,(x)=0, then the probability distribution of x in the state p is con- 
centrated at the point £ (x| p). x has then a sharply defined value in the 
state p. One might ask the question whether there are states in which 
every observable has, if not a sharply defined value, at least a small variance. 
The Heisenberg principle of uncertainty asserts that even this is not 
possible. Roughly speaking, it asserts that in any state, no matter how 
prepared, there are observables whose distributions have large variances. 
Notice that this is not possible if the logic is a Boolean o-algebra of subsets 
of a space. In fact, let & be the o-algebra of subsets of a set X. Then, for 
any observable with corresponding function f, the observable has a sharply 
defined value f(x.) whenever the state p is a measure concentrated in a 
point xo of X. On the other hand, let us assume that © is the physical 
system of some atomic particle and that x and y are the position and 
momentum observables along some direction in space. If we assume that 
the logic Y of © is the standard logic associated with the complex 
number field, then 


(57) o,(x)o,(y) > $h, 


where # is Planck’s constant divided by 2r (cf. H. Weyl [1] pp. 77 and 
393-394 for a rigorous derivation). From (57) it follows quite simply that 
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in any state in which the distribution of x has small variance (i.e., æ is 
‘localized ”), the distribution of y has very large variance. It is obvious 
that this is a property of the structure of the logic Y of ©. 


8. AUTOMORPHISMS 


Let Z be a logic, O the set of observables of Z, and SZ the set of all 
states of Y. The set of all automorphisms of ¥ forms a group in a natural 
fashion, denoted by Aut(¥#). We shall now introduce the notion of an 
automorphism of S. By a convex automorphism of S we mean a map 


(58) E:p>p i (PEP) 
such that 


(59) (i) £is one-one and maps S onto itself, 
(ii) ifci, Ca.» are numbers >0 with >, c,=1, and p,, Po ES, 


(2 CaP). = 2 Cn Pri- 


From (ii) of (59) it follows at once that convex automorphisms of F send 
pure states of Z into pure states of Z and that the correspondences 
induced in the set of pure states are one-one and onto. 

If 


(60) œ: a —> a" (ace L) 
is an automorphism of Y and if, for pe Z we define p* by 


(61) p"(a) = pia"), 
then p —> p“ is a convex automorphism of S. We shall call it the automor- 
phism induced by «. The set of all convex automorphisms of S also forms a 


group denoted by Aut( S). 
Let Z be a o-algebra of subsets of a space X, and let 


t : x —> t(x) (xex) 


be any one-one map of X onto itself such that for any set ASX, Ae L 
if and only if t7 1(4) e Y. When X and .Ẹ satisfy some technical regularity 
conditions, it can be shown that every sufficiently well-behaved auto- 
morphism of / is induced by a mapping t such as the one described above. 
Suppose X is the phase space of a classical mechanical system with a C® 
manifold M as its configuration space. X thus appears (cf. Chapter I) as 
the cotangent bundle of M. In this case, the subgroup of those auto- 
morphisms of # induced by (both ways) differentiable homeomorphisms 
of X onto itself and which preserve the canonical 2-form on X, plays un- 
doubtedly a very crucial role (contact transformations). 
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The axioms satisfied by an arbitrary logic are too weak to allow any 
reasonable determination of its automorphisms. In the next chapter we 
shall determine the automorphisms associated with standard logics. In 
the meantime, we shall briefly illustrate with two (among many) examples 
the important role played by the group Aut( S) in physical problems. 

As a first example we consider the problem of describing the dynamics 
of a physical system ©. Let Z be the logic of S, Z the convex set of all 
its states. Then, for each real number t, there exists a unique one-one 
map of F onto itself, say D(t), with the following physical interpretation; 
if pe F is the state of © at time tọ, then D(t)p is the state of © at time 
t +t. The dynamical group 2 where 


(62) 2? : t — D(t) 


satisfies the conditions (3) of Chapter I. We now postulate that each D(t) 
is in fact an automorphism of S, i.e., if p,, pe, -- - is a sequence of elements 
of SY and c4, C: a sequence of nonnegative numbers such that 
Cı +c +- =1, then, for each t, 


(63) Dt) (>, api) = 2 o,D(t)p,. 


This postulate may be justified by the following argument. Suppose one 
does not know the state at time 0 but only that it is one of p,,p.,--- with 
probabilities c,, c2,---, respectively. Then from the fact that the state at 
time t is D(t)p, if the state time 0 is p,, it follows that at time ¢ the state 
of the system © is one of D(t)p,, D(t)p2,--+ with the same probabilities 
C, Cg,:*+. But the statement that the state of Z (or ©) is one of q4, q2, ++: 
with probabilities c,, c,,--- is physically equivalent to the statement that 
the state of Z is cq; +Coge+---. This proves the validity of (63). It 
follows now that each D(t) leaves invariant the subset of pure states of Z 
and induces a one-one mapping on it. Furthermore, if ae Z, then, for 


each pe J, 
(64) t —> (D(t)p)(a) 


is a real valued function of t; it is physically reasonable to assume that 
for any ©, this function is Borel for each pe ¥ and a e &. If this con- 
dition is satisfied, we call 2 a Borel one-parameter group. Thus the dynamical 
group of S may be assumed to be a Borel one-parameter group of convex 
automorphisms of S. 

This leads us to introduce the concept of representations of groups. 
Let G be a locally compact topological group satisfying the second ax- 
iom of countability. The sets of the smallest o algebra of subsets of G 
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containing the open sets are called the Borel sets in G. By a representation 
of G in Aut( F) we mean a map 


(65) T:g—>T, (g EG) 
of G such that 


(i) each T, E Aut(S), 
(66) (ii) Ta, 92 = Ta Toa (91; J2 E G), 
(iii) for each ae # and pe Y, the function g —> (T',p)(a) is Borel 
on G. 


With these definitions we may say that the dynamical group of a system 
© is determined by a representation of the additive group of real numbers 
into Aut( S), where S is the set of states of the logic LZ of ©. 

As our second example, we consider the problem of describing the 
quantum mechanical theory of a free particle. The configuration space of 
the particle in classical mechanics is the three-dimensional space, denoted 
by M. Let Z be the logic of the physical system under consideration 
and S the convex set of its states. Suppose now an observer O wants to 
describe the system. He would then select a point of M, an orthogonal 
frame at O, and map M onto R? by a map 


(67) yo : M > R. 


Let O’ be another observer and let yọ- be the corresponding map. Then 
there exists a unique Euclidean motion Do,o of R? such that for all 
meM 


(68) yom) = Do,o(Yo(m)). 


Suppose now O describes the state of the system © by an element po E S. 
Then, O’, using his orientation, would describe the same physical state 
by an element po, of S. It is clear that the map Po —> Po is an auto- 
morphism, say &o,o of F. In other words, «o,o: has the following physical 
interpretation: if the physical state of © is described by O by the element 
gE S, then «o,o(g) is the element of S describing the same physical 
state of © from the point of view of 0’. It is obvious that «o,o is the 
identity for O=O’, and 


(69) &o,0" = @o',0o"%o0,0" 


We now remark that the basic physical laws of © are independent of 
` the choice of observers and their frames. Therefore, we may conclude that 
&o o depends only on the transformation Do o-. Let G be the group of all 
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Euclidean motions of R?. Then, from (68), (69), and the above remarks 
on the «o.o, we infer the existence of a homomorphism 


(70) T:g—>T, 


of G into Aut() with the property that whenever O and O’ are two 
observers of S, 
(71) %0,0° = Tro,0" 


We may reasonably make the assumption that for each a e # and pe YX, 
g — (T,p)(a) is a Borel function on G. T is thus a representation of G in 
Aut(S). 

We see thus that the description of © gives rise to a representation of 
G in Aut( S). The determination of such representations is therefore the 
basic problem to be solved in the mathematical description of the 
physical system of a free particle. 

The reader might notice that the arguments given above are very 
general. In fact, the same type of arguments lead to the remarkable fact 
that with every relativistic free particle there is associated a representation 
of the inhomogeneous Lorentz group. These representations were first 
completely determined by Wigner under the assumption that the logic of 
© was standard. This led in turn to his celebrated classification of the 
relativistic wave equations [1]. 


NOTES ON CHAPTER III 


1. As a general reference on lattice theory we refer to the book of 
F. Maeda and S. Maeda, Theory of Symmetric Lattices, Springer-Verlag, 
Berlin, 1970. 


2. There has been a good amount of work by many people in recent years 
on the structure of logics and probability measures on them, motivated 
mostly by the relevance of such problems to the foundations of Quantum 
Mechanics. The encyclopedia volume of E. G. Beltrametti and G. Cassinelli 
(The Logic of Quantum Mechanics, Volume 15, Encyclopedia of Mathematics 
and Its Applications, Addison-Wesley, Reading, Mass., 1981) gives an 
excellent overview of this work (Chs. 10-20). In addition, this book presents 
a well-organized, comprehensive discussion of the foundations of Quantum 
Mechanics from different points of view with many references to current 
literature, examples, and exercises. Especially illuminating are the critiques 
of controversial topics such as hidden variables (Chs. 15, 25) consistency of 
physical interpretation (Chs. 7, 8), the process of measurement (Ch. 26), 
and so on. As additional references we mention the following: 

J.M. Jauch, Foundations of Quantum Mechanics, Addison-Wesley, Reading, 

Mass., 1968. 
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C. Piron, Foundations of Quantum Physics, Benjamin, Reading, Mass., 1976. 

M. Jammer, The Philosophy of Quantum Mechanics, Wiley, New York, 1974. 

C. A. Hooker (Ed.), The Logico-algebraic Approach to Quantum Mechanics, 
Vol. 1, Reidel, Dordrecht, 1975. 

E. G. Beltrametti and B.C.van Frassen (Eds.), Current Issues in Quantum 
Logic, Plenum, New York, 1981. 


3. The structure of the states of orthocomplemented continuous 
geometries is still mysterious. One of the most striking results is due to von 
Neumann who showed that in the presence of a transition probability 
function with suitable invariance properties, such a lattice is nothing but 
the lattice of projections of a type II, factor (Continuous geometries with a 
transition probability, Mem. Amer. Math. Soc. No. 252, Providence, R.I., 
1981). 


4. It is possible to determine, in the standard model, the states for which 
the lower bound of the Heisenberg uncertainty relations (57) is attained, 
for the quantum mechanical system of a one-dimensional particle. Let 
HAH = L? (—00,00), 


hd 


g=operator of multiplication by x, p = 7 de’ 


then 


bol J 


09(J)* oolp) = 


for the family of states p=¢,,,,,(y,0,0€R, y>0) given by 


p(z) = (Z) vi- (y/2h) (x—0)?+ (p/h)ia}; 
then 
colg) = (4/2), agp) = (hy/2)} 
(cf. von Neumann [1], pp. 233-237). 

As we shall see in greater detail in Chapter IV, any vector g of unit norm 
in L?(—co,00) defines a pure state of our system; the probability for 
SeL(H#) is ||\PSo||?, P being the orthogonal projection # -> S: g is called 
the wave function, and since the spectral measure of q is E >qz where qe 
is multiplication by the characteristic function of Æ, the probability distri- 
bution of q in the state g is the measure 


pi ko i |p|? dx. 


Thus the wave function allows one to calculate, by this formula, for any 
region Ẹ c€ (—%,0), the probability (E£) of finding the particle inside Æ. 
Tbis is the simplest instance of the statistical interpretation of Quantum 
Mechanics which is due to Max Born (Z. Physik, 37 (1926), pp. 863-867). 


CHAPTER IV 
LOGICS ASSOCIATED WITH HILBERT SPACES 


1. THE LATTICE OF SUBSPACES OF A 
BANACH SPACE 


The general theory that was developed in Chapter III is concerned 
mainly with the broad principles of Quantum Theory. Further develop- 
ment of the theory, such as a discussion of simple quantum mechanical 
systems, leads to problems of a more technical nature. These problems 
are, however, difficult to answer in the context of abstract logics, and 
therefore, in dealing with them, it becomes necessary to restrict the class 
of logics under consideration. In this chapter we shall examine some of 
these logics and describe, in a somewhat more concrete fashion than in 
Chapter III, the observables, states, and symmetries associated with them. 

We have seen in Chapter II that a large class of orthocomplemented 
geometries are obtained by considering Hilbert space structures on finite 
dimensional vector spaces. These logics are, however, not of much use in 
discussing even very simple atomic systems, since every observable 
associated with them is finite. Consequently, if one wants to obtain 
realistic examples, one is forced to deal with infinite dimensional vector 
spaces. 

It is known (cf. Baer [1]) that the lattice of all linear manifolds of an 
infinite dimensional vector space does not admit any polarity. Therefore 
it becomes necessary to consider lattices Z such that the elements of L 
are certain linear manifolds of a given vector space V and, wherein, the 
partial ordering is the inclusion relation. It must be noted that the lattice 
sum of two elements of such a lattice contains, but in general does not 
coincide with, the algebraic sum in V. A simple and interesting class of 
such lattices may be obtained in the following way. Let D be a topological 
division ring, say, for example, one of R, C, or H. Let V be a topological 
vector space over D (cf. Bourbaki [2] for the definitions and elementary 
properties). Then, the set of all closed linear manifolds of V is a lattice—in 
fact, a complete lattice. One may then construct examples of logics whose 
underlying lattices are these lattices of closed manifolds. The description 
of such logics would provide us with a large class of examples of logics 
which are complex enough to serve as models for the logics of atomic 
systems. 
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The main purpose of this section is to prove a theorem of Kakutani and 
Mackey ([1]), which asserts that if V is a Banach space, the above pro- 
cedure leads to no new examples of logics other than the standard ones. 
In particular, if the underlying lattice of a logic is isomorphic to the 
lattice of all closed linear manifolds of a separable infinite dimensional 
Banach space over one of R, C, or H, then the logic is standard. 

Let D be one of R, C, or H and V any vector space over D. We shall 
say that a topology 7 on V is compatible with the vector space structure 
of V if it is Hausdorff, and, if the map, a, b,z,y>a-2+b-yofDxDxVxV 
into V, is continuous. A topological vector space over D is a pair (V,7), 
where V is a vector space over D, and 7, a topology over V compatible 
with its vector space structure. By the usual abuse of language we shall 
say that V is a topological vector space. We shall denote by ¥(V,D) the 
lattice of all closed linear manifolds of V. This is a complete lattice. If we 
denote by v and A the lattice operations in #(V,D), then, for any 
family {1,} of closed linear manifolds of V, we have: 


(1) A Ma = N Mw 
(2) VM. = (2 M.) 


where > denotes the algebraic sum, and the bar denotes closure. If N is 
any finite dimensional linear manifold of V, then N is necessarily closed 
and for any closed linear manifold M, Nv M =N +M. Moreover, if M is 
a closed linear manifold whose codimension (=dim V/M) is finite and N 
any algebraic complement of M, i.e., such that MA N=0, M +N=V, 
then N is necessarily closed and V =M v N (for these facts cf. Bourbaki 
[2], Ch. 1). 

We shall recall the notion of Banach spaces. Let D be one of R, C, or H 
and V a vector space over D. A norm over V is a function x —> ||| of V 
into the real numbers such that 


(i) |v] > 0,=0 ifandonlyif x= 0, 
(3) Gi) le+yl] < lel+l|y] for z,yeF, 
(ii) cal] = |c|-|z] for ceD,xeF. 


A Banach space over D is a pair (V, ||- ||), where V is a vector space over 
D and ||- || a norm on V such that V is complete under the metric d 
defined by d(x,y)= ||x—y||. A Banach space over D is obviously a topolog- 
ical vector space over D. 

We are now in a position to formulate the theorem of Kakutani and 
Mackey. We recall the notion of Hilbert spaces and inner products 
associated with D (cf. Chapter II, section 4). 
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Theorem 4.1 (Kakutani-Mackey [1]). Let D be one of the three division 
rings R, C, or H, V an infinite dimensional Banach space over D, and 
L=L(V,D) the lattice of all closed linear manifolds of V. Suppose that 
| (M — M+) is an orthocomplementation of L. Then there exists a D- 
valued inner product <.,.)> on V x V such that (i) V becomes, under <., .>, 
a Hilbert space over D; (ii) the topology of V, induced by the norm associated 
with <.,.>, coincides with its original topology; and (iii) the map M —> M+ 
coincides with the orthocomplementation induced by <., .>. In particular, if 
the underlying lattice of a logic is isomorphic to the lattice of closed linear 
manifolds of a separable Banach space of infinite dimension over D, then 
the logic is standard. 

The proof is rather long and depends on a series of lemmas. With later 
applications in mind we formulate our first lemma somewhat more 
generally than is necessary for our immediate purposes. 


Lemma 4.2. Let K be a division ring and W a vector space of infinite 
dimension over K. Let 4 be a lattice of linear manifolds of W such that 
(i) M contains all finite dimensional linear manifolds of W, and (ii) if 
M,N e Mand at least one of them is finite dimensional, then Mv N=M +N. 
Suppose M(M —> M+) is an orthocomplementation in M. Let woe W be 
any nonzero vector. Then there exists an involutive anti-automorphism @ of 
K and a symmetric 0-bilinear form <. , .> on W x W such that (i) <wo,Wo> = 1 
and (ii) <x,y> =0 if and only if xe (K-y)+-6 and <., .> are uniquely deter- 
mined. The form <., .> is definite, i.e., (2,2) =0 if and only if x=0. 


Proof. Let <= W be a finite dimensional linear manifold. Then F Ee MA. 
For any linear manifold L& F we have L eM also. We now define 


(4) é(L) = LAF. 


Note that if M is finite dimensional and N e M, then MAN=MQNON. In 
fact, MANSMAN always while the finite dimensionality of MAN 
implies, by (i), that MON e M, so that MONSM AN. 

Let Mp be the projective geometry of all linear manifolds Dc F. £p is 
a map of Ap into itself. We shall show first that p is an orthocomple- 
mentation of Æp. Clearly £,(0)=F, &(F)=FtnF=F' a F=0. Since 
M —> M+ is order inverting, it is obvious that p is order inverting. We 
observe next that for Le @,, DNEé(L)CLAL+=LAL*+=0. On the 
other hand, W=Lvy L+=L+ L+ so that F=L+€,(L). Thus L and 
ép( L) are complements of each other in Æp. To complete the proof that £p 
is an orthocomplementation, it remains to prove that ép is involutive. 
For Le Mp, épr(L)S Lt so that Lc €,(L)*. This shows that LS €,(L)'n 
F=€,(&;(Z)) for all Le Mp. Let L'=€,(L). Then, by what we proved 
just now, £,(L’) is a complement of L’ in Æ p. But L is also a complement 
of L'’=&,(L) and we have seen that Dcé,(L’). This shows that L must 
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coincide with £,(L’). We see thus that ép is an orthocomplementation 
of Mp. 

We shall fix a nonzero vector wy e W. Let F be the class of all finite 
dimensional linear manifolds Fc W such that (i) wọ e F, (ii) dim F>3. 
By theorem 2.7 there exists, for any FeF, an involutive anti-automor- 
phism fp of K and a nonsingular symmetric 6,-bilinear form <.,., on 
Fx F such that <.,.>, induces ép; i.e., for L, 


(5) €-(L) = {u : ue F, zu, =0 for allze LD} 
and 
(6) (Wowo) = 1. 


Moreover, (6) determines 6, and <.,.Yp uniquely. Suppose now that 
F,, Fae F with FiS F}. Then, for any Le M p,» 


Ep, (L) = L+ A F, = {ér (L)} N F. 
This shows that for any L E€ Mp, 
(7) fr, (L) = {u : ue Fy, (z,u>,, = 0 forall ze LZ}. 


In other words, the restriction of <. , .Xp, to F, x Fy isa 6,,-bilinear form 
which also induces the orthocomplementation £p, . In view of the normaliz- 
ing condition (6), we may conclude at once that 


Op, = Op, , 


(8) 
Yr, = Eyr, (x,y eF). 


Since any two elements of F are contained in a third element of F, we 
easily deduce from (8) that there exists an involutive anti-automorphism 
0 of K, and a symmetric -bilinear form <.,.5 on Wx W such that 
(i) 0=0p for all Fe F, and (ii) the restriction of <.,.ẹ to Fx F is 
{.,.)F for all Fe F. Lemma 4.2 follows at once from this. 

We now resume our discussion of the lattice Z of all closed linear 
manifolds of the Banach space V over D. We take V=W in lemma 4.2 
and obtain an involutive anti-automorphism 0 of D and a definite 
symmetric 6-bilinear form <¢., .> on V x V such that <x,y) =0 if and only 
if x| y. We fix a nonzero vector v€ V; then 0 and ¢.,.» are uniquely 
determined by the condition <v),v))=1. 

Let V* be the set of all continuous linear maps of V into D. Then V* is 
a vector space over D°, the division ring dual to D. For we V, let 8, be the 
linear map of V into D defined by 


(9) Pule) = (ur (we V). 
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We shall also use the conjugate norm on V*; this is defined by 


(10) JA] = sup Aw] (Ae V*). 
It is well known that V*, under this norm, is a Banach space over the 
division ring D°. 


Lemma 4.3. B, e V* for any ue V. Moreover, u —> B, ts an additive iso- 
morphism of the abelian group V onto the abelian group V*. 


Proof. It is a well known fact (cf. Bourbaki [2]) that a linear map A of 
V into D is continuous if and only if the linear manifold 


(11) Z,={x:x2EV, A(x) = 0} 


is closed in V. Let now A=8,, and write Z, for Z,, ue V. If ye V, ye Z, 
if and only if <y,u>=0, i.e., if and only if y | u. Hence 


(12) Z, = (Deu). 


But (D-u)+ e Z and is therefore closed by assumption. This proves that 
B, e V*. The map u — 8, is clearly additive. If, for ue V, 8, =0, then 
<u,u» =B,(u)=0; this implies that u=0. In order to complete the proof 
of the lemma it remains to prove only that u —> p, maps V onto )*. 
Suppose Ae V* and #0. Let Z, be defined by (11). Then Z, € # and 
Z, Æ V. Therefore Z,+ 40. Let u be a nonzero vector in Z,+. Since <x,u> =0 
for all x € Z,, u vanishes over Z,; and since u#0, 8,40. Since Z, and 
Z, are both of codimension 1 and Z,¢Z,, we infer that Z,=Z,. Let 
xo E V be such that A(x) =1. Since xo € Zy, £o ¢ Z, and hence 


Bu(Xo) = [ToU Æ 0. 


Let c=<xp u) t and let us write ug =c°u. Then for xe V, 
{X,Uo> = (X,UDC. 


This shows that B,,,(%)=1=A(%p) and implies that B,, =A. 

The main question at this stage is whether @ is continuous. The non- 
trivial case is when D=C and we handle it first. It is at this point that 
essential use is made of the infinite dimensionality of V. 


Lemma 4.4. Let W be a complex Banach space of infinite dimension. 
Then one can find a sequence of vectors x,,%,--- E W with the property 
that, for any bounded infinite sequence {c,} of complex numbers, there exists 
a continuous linear functional À on W such that 


(13) A(%_) = Cy (n = 1,2,---). 
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Proof. Let ||-|| be the norm on W. Let W* be the space of all complex 
valued continuous linear functionals on W. We define x, e W to be an 
arbitrary vector of norm 1 and choose x,* e W* such that 


[z *] = 1, 
x(x) = l. 


The celebrated Hahn-Banach theorem ensures that such an 2,* e W* 
exists. We shall now define by induction two sequences {z,,} and {x,*}in W 
and W*, respectively. Suppose %1, %,:--, 2,6 W and 2,*,---,2,* E W* 
have been already constructed so that 


rt (x) = ôy (Kronecker delta) (4,7 = 1, 2,---,), 


14 
(14) a*|| = 2-6-D (i = 1,2,--+, n). 


Since W is infinite dimensional, there is a nonzero vector x,,, in W 
linearly independent of 2,,--+-, 2, such that 


©" (%n41) = 0 (2 = 1,2,---, n). 


Using the Hahn-Banach theorem once again, we can find x*,, e W* such 
that x*, 1(%,41) 40, rž, (xı) =0, 1<n. Replacing x, ,, and x*,, by suitable 
multiples, we may assume that |7*,, || =2-", c*,1(%,41) =1. The sequences 
Lis Zo **'s Ung, and 2,*,---,2*,, then satisfy (14) with n replaced by 
n+1. By induction it follows that there exist infinite sequences {z,} and 
{x,*} such that x, e W for all n, z,* e W* for all n, and 


x,*(2;) = Ôi; (1,9 = l, 2, ee -), 


15 
(15) jat] = 274-9 (§ = 12+). 


We claim that the sequence {z,,} has the properties asserted in the lemma. 
In fact, let {c,} be any bounded sequence of complex numbers. For x e W, 
let A(x) be defined by 


(16) Na) = > opea*(a), 


The inequality |x,*(x)| < ||z,*||- |æ] =27%~ zl] shows that A is well 
defined and that x —> A(x) is a continuous linear functional on W, i.e., 
Ae W*. Clearly, A(x,)=c, for all n. 

We now resume our discussion of V and 8. 


Lemma 4.5. If D=R, 0 is the identity; if D=C, 0 is the conjugation; and 
if D=H, 0 is the canonical conjugation. 
Proof. Let F be any linear manifold of finite dimension >3 containing 


v The map L-> L+ 1 F is, as we saw in the proof of lemma 4.2, an ortho- 
complementation in the geometry of subspaces of F and is, moreover, 
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induced by ¢.,.)# (the restriction of (.,.) to F x F). From theorem 2.8 
we deduce that @ is the identity if D =R and is the canonical conjugation 
if D=H. Suppose now that D=C. Clearly, as (z,7)r=0 for an xeF 
implies that x=0, 0 is not the identity. @ will therefore have to be the 
conjugation in this case, provided we show that it is continuous. Since 
8 is additive, this reduces to showing that if c, —> 0 in D, {c,°} is a bounded 
sequence. Suppose then that D=C and that 0 is not the conjugation. 
Then there exists a sequence {c,} in D such that 

Ca —> 0, 
(7) |c,°| —> 00. 
Since dim V = œ, we can select a sequence {z,} in V such that it has the 
property described in lemma 4.4. Let 8, be the element x->(z,z,) of 
V* (cf. lemma 4.3). Since |c,°|-> 00, we may, by passing to a subsequence 
if necessary, assume in addition that for all n, 


(18) len] > n: |IBall- 


By lemma 4.4 we can find a àe V* such that A(z,)=c, for all n. Evidently, 
\0. Moreover, it follows from lemma 4.3 that there is a u40 in V such 
that A=B,. Then <u,u> 40 and <z, u) =c, for all n. If 


(19) Zn = CU,UX TIU — Cp” Ep, 
then 

À(Zn) = Zn u) = 0 
for all n. But then, as <., .> is symmetric, 


LU, Zn = 0 
for all n, i.e., 


(20) (Cu?) *Cu,w> — (Cn?) Ku, tn> = O. 
Since |(c?) t<, £Y] < |6| 71au], (18) implies that 
(Cn) CU, Eny > 0  (n—> 00) 


and hence we infer from (20) that (u,w>(<u,u>*)~+=0, which is impossible 
as u#0. This contradiction proves that 0 must be the conjugation and 
completes the proof of lemma 4.5. 


Lemma 4.6. (u — B,) is a continuous map of V onto V*. 


Proof. Since we may obviously consider R as a subfield of D as well as 
D°, V and V* may both be regarded as Banach spaces over R, denoted 
by Vr and V,*, respectively. From lemma 4.5 we conclude that 
B(u — B,) is an R-linear map. As £ is defined on the whole of Vz, we may 
use the closed graph. theorem to reduce the proof of the continuity of B 
to the proof that £ is a closed map. Let {z,} be a sequence in Vpr such that 
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Za —> x and By, —> A. We must prove that A=f,. Since $ is onto, we may 
assume that A=f,, for some we V. From the formula (10) for the norm 
in V*, it follows that B,,(v)>f,(v) for all ve V. Since, by lemma 4.5, 
6 is continuous, we have, for ve V, 


<x wy = him <x, Y) 
= lim (Koea) 
= Cony 
= (uwy. 
This proves that x =u or 8, =A. We have thus proved lemma 4.6. 


Lemma 4.5 shows that {.,.} is an inner product on V x V. Thus, on 
defining ||z||,2=<z,z>, we obtain a norm on V. V is a pre-Hilbert space 
with respect to ¢.,.>. 


Lemma 4.7. V is a Hilbert space under <., .>. 


Proof. We must prove that V is complete under ||: ||). Let {z,} be a 
sequence in V such that ||z, —2m||)? —> 0, n, m — œ. Then <y, £n — £m? —> 0 
as n,m-—>oo for each ye V. Let A(y)=lim, <y,2,>. Then A is a linear 
map of V into D. Since B,, € Vy* for all n, we may apply the well known 
uniform boundedness principle to Vp, a Banach space over R, to conclude 
that àe V*. By lemma 4.3 there exists a ze V such that A=8,. We claim 
that |z,—z||) > 0. Let e>0 be arbitrary and N be such that 


||z,—Zmallo Se 
for n, m> N. Then, for ye V, 


KY, En~ Em?) < ellyllo (n,m z N) 
and hence 


KY £a — | < elylo (n zN), 
which shows that lz„— zlo <e for n> N. 


Lemma 4.8. The topologies induced on V by ||- || (the original norm) and 


I- lo coincide. 

Proof. Let y, x1, £2,- - be elements in V. We must show that as n — œ, 
lza — yllo — 0 if and only if ||z,—y|| > 0. Since V is a Banach space over 
R under both ||-|| and ||- llo, it is sufficient, in view of the open mapping 


theorem, to show the implication in one direction only. Let then 
|x, —y|| > 0. Since the map £ from V to V* is continuous, Bz, —B, —> 0 
in V* so that 

(21) sup |<z,t,-y>|>90 (n> 0). 


heist 
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Now, |x, —y|| <1 for all sufficiently large n and hence we infer from (21) 
that 


lz, —Yllo” = (tn y, En — Y) —> 0, 


which is the relation required. 


We can now complete the proof of theorem 4.1. V is a Hilbert space 
under <.,.ẹ and the topology induced by the corresponding norm has 
been shown to coincide with the original topology for V. Consequently, 
the same linear manifolds are closed in both topologies. Let £(M —> &(M)) 
be the orthocomplementation of the Hilbert space V. Now, <y,2>=0 if 
and only if x {| y, and hence for any M e Y, ye M+ if and only if y e &(M). 
This proves that | and é coincide. The proof of theorem 4.1 is complete. 

Theorem 4.1 shows that a logic may be asserted to be standard as soon 
as it is identified as a member of an apparently more general class of lattices. 
We have proved theorem 4.1 to emphasize our point of view that the 
important role played by standard logics is not due to accident but stems 
from deeper mathematical reasons, and that theorems such as the present 
one lead to a clarification of some of these reasons. This said, the stage is 
set for the detailed analysis of the standard logics and we proceed to do 
it now. 


2. THE STANDARD LOGICS: 
OBSERVABLES AND STATES 


We now proceed to describe the structure of the set of observables and 
the set of states associated with a standard logic and examine the manner 
in which the general concepts of Chapter IIT specialize. Throughout this 
section we write D to denote one of R, C, or H. Let # be a separable 
infinite dimensional Hilbert space over D. We write Z for the logic 
L(#,D) of all closed linear manifolds of # (with its canonical ortho- 
complementation). 

For any closed linear manifold M of #, let us write P™ for the orthog- 
onal projection on M. If f(E + f(£)) is any observable associated with 
the logic Z, then the map E > P! is a projection valued measuret based 
on Z(R?). Conversely, it is obvious that to any projection valued measure 
P(E -> Pz) based on A(R?) there exists a unique observable f(E -> f (E)) 
such that P,;= P/™ for all real Borel sets E. 


+ The concept of a projection valued measure is nothing new; it is a standard 
tool of spectral theory. For complex Hilbert spaces we refer the reader to sections 
35-38 of Halmos [2]. For the real case very few changes have to be made. For the 
quaternionic case and for projection valued measures defined on the real line, the 
theory is essentially the same, since the real field is the center of H. We shall assume 
the basic facts concerning projection valued measures as known. 
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Lemma 4.9, Let M,,M,¢ Z. Then M, +> M, if and only if the pro- 
jecttons P™1 and P™2 commute. 


Proof. Let M,, Ma Ee L. Assume first that M, <> M,. Then there exists 
mutually orthogonal elements N, Nao N of # such that M,=N,+N 
(¢=1,2). Clearly, P”i=P%1+P%, and one is led immediately to the 
equation PMıP™M2= PM2P™:, Conversely, let P@1 and P™M2 commute and 
let P= P™1P™2, Then P is a projection. If we write Q= P™~:— P, then it 
is easily verified that Q; is a projection (t=1,2) and PQ,=Q,P=0, 
QQ- =Q-Q; =0. If N,, No, and N are the closed linear manifolds which are 
the ranges of Qi, Qo, and P, respectively, then it follows easily that 
N,, Nz, and N are mutually orthogonal and M,=N,v N (1=1, 2). This 
proves that M, «> Mg. 


Lemma 4.10. Let X be a set and Z a o-algebra of subsets of X. Let 
P(E —> P,) be a projection valued measure in # based on B and F the 
algebra (over R) of all real valued bounded measurable functions on X. For 
geF let g| =sup{|g(x)| : xe X}. Then, for any ge F, there exists a 
unique, bounded, self-adjoint operator B, on # such that 


(Buv) = f, g(x)dv, sla) (m, vE), 
(22) 


(2, = Í gdP, ùn symbols) 
x 
where v, „ is the D-valued measure on B defined by 
Vyvo(L) = CP,ru,v>. 


The spectral measure of B, is the projection valued measure E —> P,- 1 (py. 
One has: 


(23) [IB < lg] (EF). 


The correspondence g —> B, is a homomorphism from the algebra F into 
the algebra (over R) of all bounded operators on #. In particular, B,, and 
B,, commute for all g1, 92 €F. 


Proof. The proofs are similar to the ones in section 37 of Halmos [2] 
and we omit them. In the quaternionic case we have to use the fact that 
the real field is the center of H. 


Theorem 4,11. Let D be one of R, C, or H and Ẹ the logic of all closed 
linear manifolds of H. For any observable f(E —> f(E)) associated with 
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L, let A, be the self-adjoint (not necessarily bounded) linear operator with 
spectral measure E —> P!®, Then 

(24) fA, 

is a one-one correspondence between the set of all observables associated with 
SL and the set of all self-adjoint operators on Æ. The observable f is bounded 
if and only if the operator A, is bounded. Two bounded observables f, and fo 
are simultaneously observable if and only if the (bounded) operators A,, and 
A,, commute. If f is any bounded observable and p is a polynomial with real 
coefficients (in one variable), then the operator corresponding under (24) to 
pof is p(A,). More generally, if f,,---,f, are bounded observables, any two 
of which are simultaneously observable, and if p is a real polynomial of r real 
variables, then the observable po(f;,...,f,) (cf. theorem 3.17) has the corre- 
sponding operator p(A,,, ...,A;,). 


Proof. The first assertion concerning the correspondence f— A, is an 
immediate consequence of the spectral theorem. Next, an observable f is 
bounded if and only if for some compact set K ¢ R!, f(K)= 2. This is 
possible if and only if P = I, which in turn is well known to be equivalent 
to the requirement that A, be bounded. Let now f, and f} be bounded 
observables. Then f, and f, are simultaneously observable if and only if 
fi(E,) < f(E) for all pairs of Borel sets E, and H,. From lemma 4.9 and 
standard spectral theory, it now follows that f, +> f, if and only if A,, and 
A,, commute. We now come to the last pair of assertions. Since the first 
of these is a special case of the second for r=1, we shall confine ourselves 
to the proof of the second. Let f,, ---, f, be bounded observables such that 
fix f, for all ¢,j7=1, 2,---,r. By theorem 3.17 there exists a c-homo- 
morphism F —> f(f) of @(R’) into Z such that for any i (l<1<r) and 
any He &(R}), 


m; being the projection (t ''',t,) >t, of R” onto R!. Let Pp = PP. 
Then F -> Pp is a projection valued measure on &(R’). Since the f; are 
bounded observables, there exists a compact set K €R! such that 
f(K)=# for i=1, 2,---,r. We now apply lemma 4.10 by taking X= K’, 
B =B(K'"). For any E e BK), Pa- 1 = PP, and so, writing 

Vy F) = (Pru,v), vi, (E) = (PLO uv), 


we get 


(Bru) = f mm (ty, to, a) t)dvy olti; mr, t,) 


= [tite 


= (A uw). 
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Thus, B,, =A;,,+=1, 2,---, r. Since g —> B, is a homomorphism, we have, 
for any real polynomial p of t,,---, t, 


B, = p(B,,° oe, B,,) = p(A,,,° . +, A; ). 


But the self-adjoint operator B, has the spectral measure E —> P,- 1%) 
by lemma 4.10. B, therefore coincides with the operator corresponding 
to the observable E —> f(p~1(£)) which is precisely p o (fi, +*+, fr). This 
proves the theorem completely. 


Remark. That fı <> fz is equivalent to the equation A,, A,;,=A,,A,, is 
a celebrated result of von Neumann. The fact that p o (f,,---, fr) corre- 
sponds to the operator p(A,,,---,A,;,) is remarkable since it links the 
algebra of functions of f,,---, fr with the algebra of operators on #. 


We now take up the problem of determining the states of the logic 2. 
Our basic result is the theorem of Gleason [1] which asserts that every state 
of £ can be described in a canonical way by what is usually known as a 
density matrix. We shall now proceed to a detailed discussion of this 
theorem. 


To any vector ue # with |u| =1, we associate the mapping p, of Z 
into the set of nonnegative real numbers by defining 


(25) pM) = <P!ujuy = |PMul? (MeL) 


If M=H#, p,(M)=||u||?=1, while p,(0)=0. If M,, M2, +- is a sequence 
of elements of Z such that M, | M, (tj) then, writing M =V, M, we 


have: 
Pal M) = | Pu]? 


= > [Pl 
= È, p.M). 


In other words, p, is a state of Z. It is clear that if ce D and |c|=1, 
Peu “Pu 

The general description of states depends on the concept of an operator 
of trace class. On complex Hilbert spaces they are defined and treated 
in many books on Hilbert space theory (cf. Dunford and Schwartz [2], 
for example). The treatment in the real and quaternionic cases is similar. 
A bounded operator A of # is of trace class if, for any orthonormal basis 
{e}, the series 


> |«Ae,,e,>| < 0; 
t 


then the sum 


tr(A) = 2. (Aee 
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exists for any orthonormal basis {e,} and is independent of the basis used ; 
it is called the trace of A. If A is of trace class and B is any bounded 
operator, AB and BA are of trace class; and moreover, 


tr(A B) = tr(BA). 


If A and B are of trace class, so are aA +8B where a, $ are in the center 
of the division ring over which # is defined. We shall assume the reader 
to be familiar with the basic properties of these operators. 

The states of the form p,(ue#, |u|] =1) can also be described in 
another way. Let U be the projection on the one-dimensional linear 
manifold D-w. Then, for any bounded operator A, AU is a bounded 
operator, which is of trace class, since U is (trivially) of trace class. Let 
{e,} be an orthonormal basis for # with u=e,. Then 


tr(AU) = tr(U A) 


(26) = > (AVE: 

i 

= (Auu). 

In particular, 
(27) p,(M) = tr(P*U). 
Suppose that w,, uz,» is any sequence of vectors in # with |u| =1 for 
all i. Let a,,@2,--+ be a sequence of nonnegative numbers with 
Q,+Q,+---=1. Let U, be the projection on the one-dimensional manifold 
D-u,. Then 
(28) U =) aU, 

i 


is a well-defined, bounded, nonnegative, self-adjoint operator of trace 
class and 


(29) tr(U) = 1. 
If M e L, we have 
tr(P”U) = > a, tr(PMU,) 
{ 
= > Ai Pu, (M). 
i 


Thus the function py defined by 
(30) Py(M) = tr(P™U) 


is a state of ¥ and in fact is the state >,c,p,,. Conversely, let U be a 
bounded, self-adjoint operator such that (i) U is nonnegative, i.e., 
<(Ux,xy 20 for all xe #, and (ii) U is of trace class and tr(U)=1. Then, 
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the function py defined by (30) is a state of Z. We call the operators U 
which are bounded, self-adjoint, nonnegative, and of trace class, von 
Neumann operators. 

The question naturally arises whether every state of Y is of the form 
Py for a suitable von Neumann operator U of trace 1. It was Gleason [1] 
who first proved that this is indeed so. His result, which is a cornerstone 
of the mathematical] foundations of quantum mechanics, is undoubtedly 
one of the most profound theorems in this subject. 

Gleason’s theorem is quite complicated to prove and we need to develop 
a number of technical lemmas. We begin with the concept of a frame 
function. Let & be the unit sphere of #7, i.e., 


(31) E = fx: ze H, |x| = 1}. 
A real valued function f(x —> f(x)) defined on @ is a frame function if 


(i) f(cx) = f(x) for xef and ceD with f| = 1, 
(ii) there exists a constant W such that >, f(e,)=W for any ortho- 
normal basis {e,} of # (recall that # is separable). 


(32) 


The constant W is called the weight of f. 
Suppose that U is a bounded, self-adjoint operator of trace class. Let us 
define the function fy by 


(33) fuls) = <Uxe> (wee). 


Then fy is obviously a frame function whose weight is tr U. A frame func- 
tion f is called regular if there exists a bounded, self-adjoint operator U 
such that 


(34) f= f Us 


Clearly, if f is regular, there exists only one bounded, self-adjoint operator 
U such that (34) holds; for, if U and V satisfy (34), <(U — V)xz, xy =0 for 
all x e # and hence U -— V =0. 

Suppose that p is a state of #. For any vector x e & let us define 


(35) Jal(2) = w(D-z). 


Then f, is a nonnegative frame function of weight 1. If U is a von Neumann 
operator of trace 1 and u=Py the state defined by (30), a simple calcu- 
lation shows 


(36) fa = fo 


i.e., f, is regular. Conversely, if » is a state of Z and if we assume that 
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f.=Ju for a bounded, self-adjoint operator, then it can be shown that U 
is a von Neumann operator of trace 1, and u=py. In fact, as 


(Uxz,x»> = f,(x) 2 0, 


U is nonnegative; then, if {e,} is any orthonormal basis of #7, 
> (Ue, en» = > Salen) = | 


so that U is of trace class and has unit trace. This shows that the states 
p and py coincide for all one-dimensional subspaces of # and hence 
coincide for all elements of &. In other words, Gleason’s theorem is 
entirely equivalent to proving that every nonnegative frame function is 
regular. 

We shall first examine the special case when D=R and when # is 
finite dimensional. If # is the real line, then the only frame functions are 
constants. If # is the plane R?, then it is obvious that a frame function 
can be arbitrarily defined on any quadrant of the unit circle and 
hence need have no special properties. 


Lemma 4.12. Let # = R?, the plane (D = the real field) and let & be de- 
scribed as 
(37) € = {(cos 0, sin 6): 0 < 0 < 27}. 
Then the function f : (cos 8, sin 6) — cos n8, n being an integer, is a frame 


function if and only if either n=0 or n=2 (mod 4). 


Proof. In order that f be a frame function we must have, for some 
constant W, 


(38) cos nô +cos (a8 +5)) = W 


for all 0. Differentiating with respect to 0, we have: 


nin n@+sin n(@ +3) = 0 
for all 9, i.e., 
nf (sin no)(1 +cos m) +sin > cos no} =Q. 
This is possible if and only if either n=0 or cos(nz/2)=—1, i.e., if and 
only if n=0 or n=2 modulo 4. 


Lemma 4.13. Let #” = R? be the three dimensional real Euclidean space. 
Then every continuous frame function on the unit sphere & is regular. 


LOGICS ASSOCIATED WITH HILBERT SPACES 87 


Proof. The proof makes use of harmonic analysis on & (cf. Weyl [1], 
pp. 60-63, 142-146). Let & be the set of all real valued continuous functions 
on & and G the group of all orthogonal transformations of R? which have 
determinant +1. For any element h of @ and any a e G, we define h* by 


(39) h(x) = h(a +2) (zeg). 


A linear manifold MEg is said to be invariant if hte M for all aeG 
whenever h e M. Then, according to classical spherical harmonic analysis, 
one has the following description of ¢: (1) for each integer n=0, 1,- -. 
there is a 2n +1 dimensional subspace F, of € which is invariant and no 
proper linear submanifold of which is invariant, i.e., F, is irreducible 
under the action (39) of the group G; (2) an element h e € belongs to F, 
if and only if it is the restriction to & of a polynomial p of x, y, z (the 
coordinates on R°) with the properties 


(i) p is homogeneous in x, y, z of degree n, 
(40) 


(3) every continuous function on @ is the uniform limit of a sequence of 
functions each of which is a finite linear combination of elements of the 
F; and finally (4) if F is any invariant linear manifold, closed under the 


uniform convergence topology of G, and n4, na,- (OSN <No <- -) are 
all the integers such that F, , SF, j=1, 2,---, then 
(41) F = (2 Fu) > 

Í 


where > denotes the algebraic sum, and the bar denotes the closure. 

Let now F be the set of all continuous frame functions on &. Evidently 
G-transforms (cf. (39)) and uniform limits of continuous frame functions 
are continuous frame functions. Thus the set F is a closed linear manifold 
in @ which is invariant. Therefore, there exist integers n,, N2,+-- with 
the properties described in (41). We now claim that if n is any integer 
other than 0 and 2, then, F, F. In fact, let n be any integer. We use 
cylindrical coordinates p, 0, z on & defined by 


x = pcos ô, y=psind, z=z. 
Then it is well known that 


ee #8 SR 
ox? Oy? G2 3p p* 007 pôõp az? 
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One can easily check that the functions 
p” cos n9, [p" —2(n — 1)z?p"-?] cos (n—2)0 


are homogeneous polynomials of degree n which satisfy the equation 


Thus if F, EF, then p" cos nô and [p"—2(n—1)z?p"~?]-cos(n—2) 8 are 
frame functions. For any 8, (cos 6, sin 8, 0), (—sin 6, cos 6, 0) and (0,0,1) 
form a frame of R? so that the functions obtained by putting z=0 are 
frame functions on the unit circle of the xv, y plane. Thus cos nô and 
cos(n —2)@ are both frame functions. By lemma 4.12 this is not possible 
unless n=0 or 2. Thus we must have AF C(F_,+F.)” ie fF SF t F 
since F, and F, are finite dimensional. We now claim that F¥ = F,.+F, 
and that every element of 7.+¥¥, isa regular frame function. In fact, F 
consists of the constants which are regular. On the other hand, F3 
consists of the quadratic forms q(z,y,z) which satisfy 


a Z g 


1.€., 
qlx,y,z) = ax? + by? +c2? +2 fyz +2gzx + 2haxy 
with 
a+b+c=0. 


Thus a function of @ lies in Fa +F if and only if it is of the form 
x —> (AX,x» 


(x is a vector in £), where A is a symmetric matrix of order 3. In other 
words, every element of F.+.F, is a regular frame function. Thus 
F=F,+F,. This completes the proof that every continuous frame 
function is regular. 


We now take up the proof of the deep lemma that every nonnegative 
frame function on the unit sphere in real three-dimensional space is 
actually continuous. Gleason’s proof of this result is an intricate one based 
on spherical geometry. We shall first introduce some terminology. We 
utilize the usual latitude-longitude coordinates on a sphere. For any 
point p of the unit sphere & of R?, we define the northern hemisphere 
relative to p to be all the points whose latitude 6 relative to p as north pole 
satisfies 0 < 0 < 7/2. We shall denote this by N,. Given p, let g be some 
point of N,. Clearly, g#p. Among the great circles which pass through this 
point g there is one whose plane cuts the equator relative to p at points 


LOGICS ASSOCIATED WITH HILBERT SPACES 89 


which are orthogonal to q. If the latitude of q is 0, it can be shown that this 
great circle passes through only those points whose latitudes satisfy 
—4<p<9. It is therefore appropriate to call this the EW great circle 
through q. 

The essence of our argument is contained in the next lemma. To motivate 
this lemma, let us consider in the usual coordinates x, y, z the function 


(x, Y, 2) > 2? +y?. 


This is a frame function which is nonnegative and vanishes at the north 
pole p=(0,0,1). It is constant on points with the same latitude. For any 
point q of the northern hemisphere, the points of N, other than g on the 
EW great circle through q, have latitudes strictly less than the latitude 
of g, and consequently this frame function reaches its minimum on the 
EW great circle through q, at q. The next lemma asserts that every frame 
function which is nonnegative has this property approximately. 


Lemma 4.14. Let pe & and g be a nonnegative frame function which is 
constant on the equator opposite to p. Then for any q in N, we have: 


(42) 9(q) < g(x) +9(p) 

for every point x on C, the EW great circle through q. 
Proof. We first assert that if r is any point of V,, then 

(43) g(r) < k+9(p), 


where k is the constant value which g takes on the equator. To see this, 
consider the EW great circle through r. If 7’ is one of the points at which 
this meets the equator, then r_| r’ and hence r, r’ are two points of a possible 
frame. The frame function g being nonnegative, one has: 


g(r) +9(r") < W, 


where W is the weight of g. But if we take any two points a, a’ on the 
equator which are orthogonal, a, a’, p is a frame and, consequently, 


2k+g(p) = W. 
Thus 
g(r) < W—g(r’) < 2k+g(p)—k < k+9(p). 


Suppose now g is an arbitrary point in the northern hemisphere. The 
inequality just now established proves the lemma if v lies on the equator. 
For z below the equator, — x lies above the equator, and, since g(u)=g(— u) 
for all u, it is enough to prove the inequality (42) when x lies on the EW 
great circle C through q and is also situated in the northern hemisphere. 
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Let y be a point on C, orthogonal to x, and lying in the northern 
hemisphere. We then have, if g’ is a point where C meets the equator, 


g(x) +9(y) = 9(9) +9(9') = 9(g) +k 
and, since g(y) < k +g(p) by (43), one has: 


g(x) = g(q) +k—g(y) 
> 9(q)—-9(p), 
which establishes the lemma. 


Lemma 4.15. With p and g as in lemma 4.14, let z be a fixed point in N,. 
Let M, be the set of points x on N, which have the property that for some 
point y of N, the EW great circle through x contains y and the EW great 
circle through y contains z. Then the interior of M, is nonempty. Moreover, 
for any x of M,, one has: 


(44) g(x) < g(z)+29(p). 


Proof. Since g(x)<g(y)+g(p) and g(y)<g(z)+g9(p), the required in- 
equality follows for all the points x in question. It now remains to 
establish that M, contains a nonempty open subset of V,. We use Carte- 
sian coordinates. We can arrange matters so that the coordinates of p are 
(0,0,1), while the coordinates of z are (cos a, 0, sin «). We have 0<a<z7, 
a7#7/2. One can easily see that for any point y whose coordinates are 
(€,7,¢) (0<f<1) the EW great circle through y contains z if and only if 


(£? +x?) sin a— é% cosa = 0. 
Consider now the function 4, where 
Plé) = (£ +n?) sin a— £% cos a. 


This is greater than 0 on the equator. On the other hand, there are 
points of N, for which it is <0. Therefore, if we define A to be the set of 


points 
A = {(6.7,6):0<0<1, (E,n,0) < 0}, 


then A is a nonempty open subset of N,. Suppose x lies in A. Then 
(x) <0 while the EW great circle through x meets the equator at a point 
at which is positive. Since ¢% is continuous, there must be a point y on 
the EW great circle through x at which ¢ vanishes; thus x e M,. In other 
words, A < M,. This proves the lemma. 


Lemma 4.16. With p and g as in lemma 4.14, let y>0 be a positive number 
such that g(p) <n. Then there exists a point q of N, and an open set M around 
q such that 


(45) 0 < sup g(u)— inf glu) < 3y. 
ueM ueM 
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Proof. Define b=inf,.y, g(u). Then b20. Choose ze N, such that 
b<g(z)<b+y. Consider the set M,. By lemma 4.15, M, has a nonempty 
interior and for every point x of M, one has: 


g(x) < g(z)+2n, 
that is, 
b < g(x) < b4+3n. 


Let q be any point in the interior M of M,. Then M is an open neighbor- 
hood of q and 
0 < sup g(u)— inf g(u) < 3n. 
ueM u 


This proves the lemma. 


Lemma 4.17, Let f be any nonnegative frame function and r any point 
such that for some open neighborhood M of r we have: 


(46) 0 < sup f(u)— inf f(u) < «a. 
ueM ueM 
Then for every point s of E there is an open neighborhood M, of s such that 
(47) 0 < sup f(u)— inf f(u) < 4a. 
ueMs uEeM; 


Proof. Use coordinates so that r appears as the north pole. Let t be any 
point on the equator opposite r. Since M is an open neighborhood of r, 
we can assume that every point whose latitude is > (7/2)— 6, belongs to 
M, 9, being a suitable angle in ]0,7/2[. Consider now a point u due south 
of t and latitude — 40.. Then by continuity it is clear that there exists an 
open set M, around ¢ with the following property: if k is any point of M,, 
then, on the great circle through u and k, there are points orthogonal to u 
and k, respectively, which are situated in M. Let k, k’ be two points of M,, 
and C, C’ the respective great circles through u, k and u, k’. Let x, y and 
x', y' be pairs of points, respectively, on these two great circles such that 


z, y, x, y EM, 
elu ylk, glu y Lk’. 
f(z) +f(u) = f(k) +f(y), 
fle) +S u) = fk) +f y’). 
Subtracting and rearranging, we have: 


|F- s |f@)—-fe)| +If(y-f(y)| 


< 2a, 


We then have: 


so that 
0 < sup f(u)— inf f(u) < 2a. 


ueM, ueM; 
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If sis any point of &, there exists a point t of € such that t is on the equator 
opposite r and s is on the equator opposite t. Applying the preceding 
argument twice, we see that there is an open set M, around s such that 


0 < sup f(u)- inf flu) < 4a. 


This proves the lemma. 


Lemma 4.18. Let f be a nonnegative frame function on &. Then f is 
continuous. 


Proof. Let «>0 be a given positive number. We shall prove that every 
point of & has a neighborhood in which the oscillation of f is at most e. 
Since a constant is a continuous frame function, we can, by subtracting 
the infimum of f (over £) from f, assume without any loss of generality 
that 

inf f(u) = 0. 


ue’ 


Let n>0 and p a point such that f(p)<n/2. Let o be the rotation around 
the z-axis with p as north pole through an angle 7/2. Let the frame function 
g be defined by 


g(x) = f(x) +f (ox). 
g is nonnegative, and at p we have: 
g(p) < 7; 


moreover, g is constant on the equator. By lemma 4.16 there exists a 
point q of the northern hemisphere and some neighborhood M, of q 
such that 
0 < sup g(u)— inf g(u) < 3y. 
ueMq 


ueMg 


By lemma 4.17 there exists an open set M, around p such that 
0 < sup g(u)— inf g(u) < 12y. 
ueMy ueMy 


Since g(p)<7, we have: 
0 < sup g(u) < 13». 
ueMp 


Since 0<f(u)<g(u) for all u, we have: 


and hence we have: 
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Applying lemma 4.17 again, we see that every point of & has a neigh- 
borhood over which the oscillation of f is at most 52y. If we choose 
n = 2/52, we are through. 


Lemma 4.19, Let # be a real separable Hilbert space of dimension at 
least 3. Then every nonnegative frame function f on the unit sphere & of H 
ws regular. 


Proof. Since every two-dimensional subspace L of # can be imbedded 
in a three-dimensional subspace, it follows from lemma 4.18 that the 
restriction of f to the unit sphere (L) of L is regular, and consequently 
there is a unique symmetric bilinear form on Lx L, say B,, such that 


(48) Byi(u,u) = f(u) 


for all u e &(L). We shall now define a real valued function B on # x # 
as follows. Let x and y be two vectors of # and L a two-dimensional 
subspace containing x and y. We define 


(49) B(z,y) = B,(x,y). 


Clearly, B(x,y) is well defined if x and y are linearly independent, since 
in this case L is unique; or if one or both of them is zero, since we then 
have B,(x,y)=0 for any such L. If x and y were dependent and neither 
is zero, then they span a one-dimensional subspace. If L, and L, are two 
two-dimensional subspaces containing x and y, then both L, and L, are 
contained in a three-dimensional subspace, say M. Let C be a symmetric 
bilinear form on M x M such that 


C(a,a) = f(a) 


for all vectors a in &(M). The restrictions of C to L, x L, and Lx Le 
are symmetric bilinear forms whose quadratic forms coincide with f on 
the unit spheres of L, and Lg, respectively. Since B,, and Bz, also have 
this property, we conclude that 


B,, (x,y) = C(x,y), 

Bi, (x,y) = C(x,y), 
which proves that B(x,y) is well defined. Moreover, we have 
(50) Bix) = f(x) 


for all unit vectors x in #. We now claim that B is bilinear. The homo- 
geneity of B as well as its symmetry involves only two vectors and so 
follows from the homogeneity and symmetry of B,. To complete the proof 
of the bilinearity (in view of the symmetry of B), it is enough to show that 


B(x, y +2) = B(x,y) + B(z,2) 
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for any three vectors x, y, z. Let M be a three-dimensional subspace 
containing x, y and z. Let Lı, Lz, L3 be two-dimensional subspaces of M 
containing, respectively, x, y; x, z; and x, y+z. Let C be a symmetric 
bilinear form on M x M such that C(u,u)=f(u) for all unit vectors u in M. 
Then we see at once that 


C(x,y) = B, (ay); C(x,2) = B,, (x,y) 


C(x, y +z) = B,, (2, y +2). 


and 


Since C is bilinear, we have: 


B(x, y +z) = By, (x, y +2) 


= C(x, y +2) 
= C(x,y) +C(2,z) 
= B, (x,y) + Bz, (#2) 
= B(x,y) + B(z,2), 
which proves the bilinearity of B. 
Since 
0 < B(x) < 1 


for all x with |x| =1, x, y—> B(x,y) is a bounded bilinear form. Thus there 
exists a bounded, self-adjoint operator U such that for all x, y in # 


Bix,y) = (Uxy), 
f(x) = (Ux,x). 
Thus we have proved that f is regular. 

We now proceed to the case of Hilbert spaces over D, where D may 
either be the complex number field or the quaternion division ring. Let 
H be a Hilbert space over D. A closed R-linear manifold S of # is called 
completely real if the inner product ¢., .> in # x # takes real values on 
S x S; equivalently, if and only if there is an orthonormal set {e,} such that 
S is the closure of the real linear combinations of the e,. 

Lemma 4.20. Let Z be a real separable Hilbert space, and fọ a regular 
nonnegative frame function of weight W. Then for all a,beEB with 
Jal = [4 =1, 

(51) |fo(a) —fo(b)| < 2W||a—o]. 

Proof. There is an operator T, of trace class and self-adjoint such that 

<T'c,c> =f,(c) for all unit vectors c. Then 


| fo(@) —fo(5)| = |< Ta,a>—<Tb,b>| 
= |(T(a—b), (a+b))| 
< 2) 7") a-b]: 
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But 0<<T¢c,c> < W for all unit vectors c so that |7'|| < W. Thus 


Ifo(4)—fo(d)| < 2W|a—b]. 


Lemma 4.21. Let # be a Hilbert space over D of finite dimension n. If f 
is a nonnegative frame function which is regular on each completely real 
subspace, then f is regular. 


Proof. We prove this by induction on n. For n=1, the lemma is trivial. 
Let n>1. Let d=sup,,,-;f(u) and {x,} a sequence of points on @, the 
unit sphere of #, such that f(z,) ->d. We may assume, in view of the 
compactness of the unit sphere, that x, — y, ||y||=1. We first show that 
f(y)=d. Let A,=|<y,7,>|~1<y,2,>. Since 2,—y, A,—>1. The inner 
product <A,2,,y> is real and consequently A,z, and y span, over R, a 
completely real subspace. We now have, if W is the weight of f, 


f(y) —d| Ss f(y) —f(An%n)| + [S (Ann) — d| 
< 2W|y— Antl + |f (£n) —dl, 


since f, by hypothesis, is regular when restricted to the completely real 
subspace spanned by A,2,, y, and f(%,)=f(An%,) (|A,|=1). This shows 
that f(y)=d. 

Let us now extend the function f to a function F defined on all of # 
by setting 


0 v = 0. 
leles del- w), v0 


Clearly for c e D, 

(53) F(cv) = |ce| F (v). 

We now assert that if u is any vector of # orthogonal to y, then 
(54) F(cy +u) = |c]?d + F(u) 


for all c e D. Let L be the two-dimensional real linear manifold spanned by 
y and u. This is completely real. fis regular on its unit sphere so that f is the 
restriction to L of a nonnegative definite quadratic form Q. But y is the 
point at which the maximum of the quadratic form is reached and u is 
orthogonal to y in .L. Thus the matrix of the quadratic form Q is diagonal 
with respect to the basis {y,u} of L, and consequently we have, whenever 
r and r’ are two real numbers, 


F(ry+r'u) = |rl?d + |r'|? F(u). 


It is to be noted that u here is an arbitrary vector orthogonal to y. 
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Suppose now that v is any vector orthogonal to y and c#0 an element 
from D, not necessarily real. We have: 


F(cy-+v) = F(e(y +72») 
= |cl?F(y+c71v) 
= |cPF(y+|e|~*|cle*), 


but u =|c|c7 1w is a vector orthogonal to y. Thus 


F(y+e74v) = d+ |c|~2.F(|cle~4v) 
= d+|e|~?F(2), 
proving that 
F(cy +v) = |cl?d+F(v). 

The restriction of f to the orthogonal complement % of y is a frame 
function, which, on 4%, has the same properties as f. Suppose we assume 
the present lemma to be true for all Hilbert spaces of dimension k <n. 
Then F restricted to Æ is regular by the induction hypothesis and hence 
there is an orthonormal basis {e,,---,¢é,_,} for X and real numbers 
d,,°++,4,-1,20 such that 


F(cyey +++ + +6,-1€,-1) = |c,|?d, + ve +|c,~1|7d,-4 


for all c,,---,¢C,-,;€D. By the proof given in the previous paragraph, 
this means that 


P(cy+0,€, +++ +C,—1@n—1) = |e|?d leidi +--+ H len- dna, 
which shows that f is the restriction to the unit sphere of the regular 


frame function fy, where U is the operator defined by Uy=dy, Ue,=d,e,, 
j=l1,2,---,n—1. This proves the lemma. 


Lemma 4.22. Let # be a separable Hilbert space over D which is one of 
R, C, or H of dimension at least 3 and f a nonnegative frame function on the 
unit sphere E of X. Then f is regular. 


Proof. Since the dimension of # is at least 3, it follows that the re- 
striction of f to any completely real subspace is regular. Only the case 
dim # = œ remains. Lemma 4.21 is applicable to any finite dimensional 
subspace of #. Thus for any finite dimensional subspace S of # there 
is an operator Us which is an endomorphism of S, bounded, self-adjoint, 
and >0 such that 


f(z) = (Usx,a> (xes, |æl = 1). 


The uniqueness of U șş implies that if S1 S S2, (Us 2,y)=(Us,2,y) if x,y E S. 
This implies in the usual way that there exists a symmetric semi-bilinear 
form on # x X, say, B such that 


B(x,x) = f(x) 
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for all x with |x| =1. Since 0< B(z,x) <1, there exists a bounded, non- 
negative, self-adjoint operator U such that 


(Ux,2) = B(x,x) = f(x). 
This proves the lemma. 


Theorem 4.23 (Gleason [1]). Let W be the convex set of all von Neumann 
operators of unit trace on a separable Hilbert space # of dimension at least 
3 (possibly infinite) over D. Then the map 


(55) U —> py (UEW) 


(where py is defined by (30)) is a convext isomorphism of W onto the convex 
set F of all states of the logic L. A state pe S is pure if and only if p=p, 
for a unit vector u of #, p, being defined as in (25). p, =p, uf and only if 
for some c e D with |c|=1, v=cu. In particular, the pure states of L are in 
natural one-one correspondence with the rays of X. 


Proof. That U —> py is an isomorphism of W onto S is immediate 
from lemma 4.22. Next we shall obtain a description of the pure states 
of Z. For any vector u e # with |u] =1, let p, be the state of & defined 
by 

p,(M) = || P™ul? (Me). 
We assert that p, is a pure state. Let U, be the projection on the one- 
dimensional space D-u. Clearly p,(J/)=tr(P”U,) so that 


Pu = Pup: 
To prove that p,„ is pure, let »,, pa be states of L such that 
(56) Py = 4p, +(L—a)po, 


where 0<a<1. Since U —> Py is a one-one convex map of W onto the 
set of all states, there are (unique) elements U,, U, E W such that p,=py, 
(j=1,2) and hence 


(57) Uo = aU, +(1l—a)U}. 


Since (U,z,2>2>0 for all xe Æ, j=0, 1, 2, and since (Uoz,x>=0 if x is 
orthogonal to u, we conclude from (57) that for x| u, (U,7,2)> =0 (j=1, 2). 
Now U, is >20 and hence there is a bounded self-adjoint V, such that 
V2=U;, and we have <U xx) = || V ;æx||?. This implies that V,x=0 for all 
x orthogonal to u and, consequently, that U ,x=0 for all x orthogonal to u. 
Since U, is self-adjoint, this implies that U,=c,U, for some constant 


+ If A and B are convex subsets of two vector spaces over the real number field 
and if wis a map of A into B, we say that u is convex if u (ax + by) =au(x) + bu(y) for 
z,y E€ Á, 0<a,b<1 anda+b=1. 
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¢,(j=1, 2), and c;=1 as tr(U;)=1. Therefore U,=U,=U,, which proves 
that p, is a pure state. Conversely let Uj € W be such that py, is a pure 


state. If a,,-+-,a,,:-- (20, a,+a,.+ ---=1) are the eigenvalues of Us, 
and Ui, Uz, ***, Un ** + Corresponding eigenvectors of norm 1, then 
(58) Pu, = > Gn Pun’ 

n 


As py, is pure, (58) implies that all but one of the a, must be 1, the rest 
being 0. This shows that py, =p,, for some j. 


Remark. From theorem 4.23 we can obtain at once an important charac- 
terization of the principle of superposition of pure states. Let {u,} be a 
family of unit vectors of # and let us write p, for p,,. Let p=p, be a 
pure state and let M be the smallest closed linear manifold containing all 
the u,. If p is a superposition of the collection {p,}, then p(M+)=0 as 
p,{(M+)=0 for all æ. Hence u e M. On the other hand, if u e M, then for 
any Ne having the property that p,(N)=0 for all «, p(N)=0, since 
any such UN is contained in M+. In other words, p, is a superposition of the 
family {p,,} if and only if u lies in the smallest closed linear manifold of 
H# containing all the u,. Thus, if we make the assumption that the logic 
of the physical system © is standard, the pure states can be identified with 
the points of an infinite dimensional projective geometry with the principle of 
superposition translating into the usual concept of linear dependence of points. 

Our next object is to discuss the nature of the statistics of one or more 
observables associated with the standard logic #. We begin with the 
following theorem: 


Theorem 4.24. Let x(E —>2x(E)) be an observable associated with L, U 
a von Neumann operator on X of trace 1 and py, the state M —> tr( PMU) 
of L. Let A, be the self-adjoint operator on # which corresponds to x. If f 
is any real Borel function bounded on the spectrum of x, fo x is a bounded 
observable and 


(59) E(fox|py) = tr(f(Az)U). 


If x is nonnegative, sois A x and then &(x| py) exists if and only if B = A}? U1? 
is an everywhere defined, bounded operator of Hilbert-Schmidt class; in this 
case 


(60) &(x| py) = tr(B* B). 


If x is arbitrary, then x has finite variance in the state py tf and only if 
A,,U"!? is an everywhere defined, bounded operator of Hilbert-Schmidt class. 
In this case, (60) reduces to 


(61) E (x| py) = tr(U1/274,U"?). 
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In particular, if u is a unit vector in X and p, is the state M —> || Pul? 
of L, x has a variance in the state p, if and only if u e D(A,);t in this case, 


(62) E (x| py) = (A,U,U»>. 


Proof. If S is the spectrum of x and K =(f[S])~, the closure of f[S], then 
(f° x)(K)=#; and as K is compact, this proves that y =f o x is bounded. 
The spectral measure of the operator A, is the measure E —> Q;- 1g, where 
Q is the spectral measure of A,. Thus A,=f(A,) in the usual sense of the 
functional calculus of self-adjoint operator on #. This proves (59) at once. 

We next take up the more delicate questions involving possibly un- 
bounded x. Let x be any nonnegative observable i.e., 2x([0,col)=#, 
associated with X. Then A, is a nonnegative, self-adjoint operator. We 
denote by Aj’? the unique, nonnegative, self-adjoint operator C such that 
C*=A,. Let U be any von Neumann operator of unit trace on # such 
that &(z|py)< oo. Let Q(E + Qr) be the spectral measure of A, and 
k(£)=tr(Q,U). Clearly k({0,co[)=1 and 


E (x| py) = f tdk(t). 


If pe# is any unit vector, (Q,U1/%9,U1"o) < tr(U!?Q;:UY?)=k(E); 
hence, on writing y= U1/29, 


Í ” (22dk (t) < 00, 
0 


where k, is the measure E —> (Q;¥,%>. This implies that ẹ lies in the 
domain of Al/?. In other words, A1/?U1/? is everywhere defined. Since 
A}!? is closed and U*? is bounded, it is easily seen that A1/?U1/2 is closed. 
Consequently, B= A}1/2U}/? is a bounded operator. To prove that B is of 
Hilbert-Schmidt class it is enough to prove that >, || Bp,||? <œ for some 
orthonormal basis of #. Let {p,} be an orthonormal basis of # such that 
Uo, =n, for all n; then a 20, >,a,=1. Let k,(#)=(Q:9,,9,>. Then 
each k, is a probability measure and k=}, a,k,. But 


> || Be, ||? = > a, || A 2/2, || 
= Ja, f (E ae 


= Jan | taent) 
= [, ae 
< 0 


t 2(L) is the domain of the linear transformation L. 
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For the converse, if B=A1/?U1/2 is defined everywhere and is a bounded 
Hilbert-Schmidt operator, we have, using the above notation, 


tr(B*B) = > | Be,|? = >a, f tdk,(t) < 00. 

n n 0 

But then, by Fatou’s lemma, 
Í idk(t) = &(a\py) < 0. 

0 

At the same time, the above argument proves that 
8 (| py) = tr(B*B). 

Next we consider the case of finite variance. We use the same notation. 
x has a finite variance in the state py if and only if i t?dk(t) <œ. Pro- 
ceeding exactly as before, we conclude that if this condition is satisfied, 
D=A,,U'!? is an everywhere defined operator of Hilbert-Schmidt class. 
Note that, if a,>0, op, =a, P UY?p, so that pa E 2(4,). In this case, 
CLAPP = f * tdk,(t) by standard spectral theory. Now, U™?D is a 


bounded operator and, being symmetric, is self-adjoint. As U'/? is also 
Hilbert-Schmidt, U1/*.D is of trace class. We then obtain 


tr(U"?D) = > (A,U"9,,U"9,> 


= > An<A Pn, Pn? 


This proves (61). For proving the converse, let A UY? = D be everywhere 
defined and of Hilbert-Schmidt class. We want to prove that 


f j t?dk(t) < oo. 


Write, for any Borel set EC R}, 
k,,(£) = CU? Pn, U129,). 


Then 
k(E) = tr(U*?Q,U"”) 
= > (2) 
and " 
> | Pak) = J 14.0" 94 
< o. 


It follows once again by Fatou’s lemma that Í Pa t?dk(t) < 00. 
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For the last part, let U be the projection on the one-dimensional space 
spanned by p. Then U1!/?=U and A,U*? is defined everywhere if and 
only if p e D(A,,); in this case, it is easily seen that A, ,U™? has at most 
one-dimensional range and tr(U*/#A,U*!*) =¢(A,9,> as ||p||=1. Theorem 
4.24 is completely proved. 


We next come to the analysis of the joint statistics of several observ- 
ables. For simplicity we deal only with bounded observables. Let x, 
(1<j<k) be bounded observables associated with Z. Let A, be the 
bounded self-adjoint operator which corresponds to 2,. If c4, C2,--+,¢, ED 
are real numbers, and c=(c,---,¢,), A(e)=c,A,+---+¢,A; is also 
a bounded self-adjoint operator. Let E —> Q(c; E) be the spectral measure 
of A(c). If pe & is a unit vector, and if we define g,(c; E) by 


(63) qole; E) = Qc; £)e|?, 
q,(c;°) is a probability measure on Z(R?). We say that the observables 
{%1,+°-,%} have joint statistical distributions if for each unit vector 


pE, there exists a probability measure g, on @(R*) such that for all 
vectors c = (¢,,°°*,¢,) of R* and all He A(R), 


(64) 9o(L(e)) = qo(e; E), 
where 
(65) E(e) = {(t1,--+-, be) © Gti +--+ +e,b, E E}. 


The sets H(ec) are the “‘half-spaces” in R*. Since any D-valued measure 
on &(R*) is uniquely determined by its values on the sets of the form 
E(c), it follows that q,, if it exists at all, is unique. Obviously, the above 
sense is a quite conservative one in which we may speak of the statistics 
of the observables {z,,---,2%,}; the probability measure q, will then be 
the joint distribution of {x,,---, zy} under the state of Y determined by 
om. We shall now prove the following theorem (Varadarajan [1]). 


Theorem 4.25. With the notations described above, {x,,---, xy} have joint 
statistical distributions if and only if the operators A,(1<j<k) commute 
with one another. 


Proof. If the A; commute with one another, the observables x, are 
simultaneously observable and hence there exists, by theorem 3.17, a 
o-homomorphism £ of 4(R*) into Z such that 


(66) B(n,-*(E)) = zE) | 

for all Ee Z(R?) andj=1,2,---, k, m; being the projection (t, °°° , ta) > & 
of R* into R!. If we define for a unit vector pe 3, and E e2(R?), 

(67) qo(H) = | PPE], 

then (63) and (64) are satisfied. Thus {x,, ---, x,} possess joint distributions. 
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We now proceed to prove the converse. If {x,,---,2,} possess joint 
distributions, it is obvious that for 1,7 <k, 14), {xx} have joint distri- 
butions. We may (and do) therefore assume that k=2 and prove that 
A,A,=A,A,. For any pe # which is nonzero, we write p=(|\p||)~*p 
and define q,=(|p|’¢y°qp is a nonnegative measure on #(R*) and 
Qp(R?) =|||?. Set qg=0. We now define the D-valued measure gy, g» for 
an arbitrary pair of vectors 9, ¢’, as follows: 


lo, œ = 3(do+0'—Jo-o') (D = R), 
(68) do,o = 4(Yo+0—Yo-o +19-10+0'— 19-10-07)  (D= 0), 


3 
lo, = i(tor0—do-0'+ > -nose —1-ro-o)i) (D = Q). 
r=1 


We assert that for each M e Z(R?), the map 9, 9’ > qy, y (M) is bounded 
and -bilinear on # x #. To verify this, let 9, po, p €# and let us 
write 


v( M) = lo +03,0 (M)— Io, ,o(M)—Go,,0(M); 


then v is a D-valued measure on &(R*). However, if M =H(e), where 
EeZ(R!), then q(M)=(Q(c; E)p,p) and hence from (68) we infer that 


Vo,o(L(c)) = <le; Epp). 
Consequently, v(M)=0 whenever M = E(e). Hence v=0. Similar arguments 
complete the proof that p, p’ > qe (M) is *-bilinear. Since 


0 < qo,0(M) < |pll?, 
it is clear that p, ¢'—> qo, (M) is a bounded form. We can now use a 
well known theorem to construct a unique bounded self-adjoint operator 
Py such that 

Yo,0'(M) = (Pupp). 
Clearly, 0<Py<I for all M, P,=0, and P,2z=J. Moreover, as 
M — qo,o(M) is a measure, Py is additive over disjoint sets M, i.e., if 
M,, M,,°+: are pairwise disjoint Borel sets of R? and M=\), M 


(Pupp) = > Pupp). 
n=1 
We note also, as go,.(M)>0, that for M €M, Py, < Pm,- Finally, if 


M =K(c), Pu=Q(c; E). To complete the proof we need another lemma. 


Lemma 4.26. Let B be a nonnegative self-adjoint operator and S, T two 
projections such that B<S and B<T. Then BEAT. 


Proof. Let X and Y be the ranges of S and T, respectively, and let 
C = B"?, Since |Cx||? < ||Sx]|?, C, and hence B, vanishes on X+. Similarly, 
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B vanishes on Y+ and hence B=0 on (X A Y)}. B, being self-adjoint, 
therefore leaves X N Y invariant and for re X N Y, ¢(Bu,x)< |||”. 
Thus B< P*°=SAT. 

We are now in a position to complete the argument. We must show that 
for any two sets H,, E, in @(R4), Q((1,0); #,) and Q((0,1); #,) commute. 
Write Q,;=Q((1,0); #,) and Q,=@Q((0,1); #,). Since £, x E, = E x R}, we 
have 

Pexe, < Pr, xr = Qi 
Similarly, 
Pr, xe, < Qo 


From lemma 4.26 we may conclude that 
Pr, xea S Qi A Qo 
Replacing E, by R!— E, we obtain 
Pr, xr -e, SQ A (1—Q)). 
The last two relations yield, on addition, 


(69) Qı < Qi A G24, A (I—Q2). 


However, Q, AQ, and Q; A (I —Q2) are orthogonal projections which are 
both <@,. Hence we infer from (69) that 


Qi = Qi A Q+, A (I—Q2). 


This last equation leads quite simply to the conclusion that Q, and Q» 
commute (cf. lemma 3.7). 

This theorem, taken together with theorem 3.9 may be regarded as 
giving a complete description of the circumstances under which the 
statistics of several observables may be regarded as arising from observables 
with joint distributions. 

We end these remarks with a mention of the concept of the quasi- 
probability distributions, introduced by Wigner [2] and studied extensively 
by Moyal [1]. Let x,, x2,---, x, be bounded observables with A,,---, Ák 
as their corresponding operators. We use the same notation as in theorem 
4.25. For fixed » in #, and for a given c, E — Q(c; #)=q,(c; E) is a non- 
negative measure on Z(R?). We define 


(70) 


1 
D(C Ca," * 5 Cri P) = nyn h exp {( —1)!?(cit +++ ++¢,,)}dq,(e; -). 


It is easy to prove that ®(-; pọ) is a continuous function of c,,---, c, for 
each p E and 
| D(C, erty Ck; p)| < 1l. 
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We may therefore introduce its Fourier transform, in the sense of distri- 
butions of Schwartz, say ®(-;). It is known as the Wigner-Moyal 
qguasi-probability distribution of the observables x,,---, 2, in the state 
determined by 9. 


3. THE STANDARD LOGICS: SYMMETRIES 


At the focal point of the axiomatic development of modern physics 
lies the concept of automorphism or symmetry. We have already pointed 
out that the momentum observables of a physical system are inti- 
mately related to one-parameter groups of automorphisms of the con- 
figuration manifold of the system. Moreover, we had described in Chapter 
III how the dynamical group of a physical system can be described by 
a one-parameter group of convex automorphisms of the set of states. It is 
therefore extremely important to examine the concept of an automorphism 
when the logic in question is a standard one. The main purpose of this 
section is to obtain a complete description of the automorphisms associ- 
ated with a standard logic. 

We shall introduce, for this purpose, the notion of a symmetry of a 
Hilbert space #. As always, # is separable and the field of scalars D is 
one of R, C, or H. A mapping 7(z>Tx) of # into itself is said to be a 
symmetry (of #) if (i) T is additive, one-one, and maps # onto itself, 
and (ii) there exists a continuous automorphism 8 of D such that T is 
6-linear and, for all x, ye #, 


(71) <Tx,Ty> = <a,y>®, 


If D=R, then @ is the identity and the symmetries are none other than 
the unitary operators of #. If D=C, then @ is either the identity or the 
conjugation; in the former case, T is unitary, whereas in the latter case T 
is anti-unitary. If D=H, then there exists a ceD such that |c|=1 and 
d®=cdc~} for all d e D; in this case, 


T (dx) = (cdc~*)Tx 
and 
(Tx, Ty) = ca,ye-*. 
Equation (71) implies that 
[Tz] = [al 
KTz,Ty>| = |<x,y>| 


In the general case of (71) we shall call T 6-unttary. The symmetries of 
H form a group; if T, is 0,;-unitary(7=1,2), 7,7. is 0,6,-unitary and 7',~ ‘is 
6,->-unitary. The unitary (linear) operators of # form a normal subgroup 


(x, y E HR). 
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U(X) of the group of all symmetries. We shall write S(#) for the group 
of symmetries. The map which associates with any 6-unitary T, the auto- 
morphism 0 of D, is a homomorphism of S(#) onto the group Aut(D) of 
all continuous automorphisms of D. Since the kernel of this map is (Æ), 
we have: 


(72) SIUH) ~ Aut(D). 


Note finally that if T is a symmetry and c e D is such that |c|=1, cT is a 
symmetry. 


Theorem 4.27. Let 4 be a separable infinite dimensional Hilbert space 
over D. Let «(M —> M*) be an automorphism of the logic L. Then, there 
exists a symmetry T of Æ such that 


(73) M“ = T[M] (MeL). 


If T’ is a symmetry such that (73) is satisfied for all M, then there exists a 
ce D with |c|=1 such that T’=cT. Finally if T is any symmetry of X, 
M —> T({M] is an automorphism of the logic L. 


Proof. It is quite trivial to check that if T is a symmetry of Z, 
M —> T[ M] is an automorphism of the logic 2. 

Suppose now that «(M — M") is an automorphism of the logic Z. Let 
F be a linear manifold, chosen once for all, such that dim F=3. By 
theorem 2.1 there exists an automorphism 0 of D and a 6-linear iso- 
morphism Lp, of F onto F”, such that for any linear manifold Mc F, 


M“ = LM]. 


Let us define F to be the collection of all finite dimensional linear mani- 
folds G which contain F. We claim that for each Qe IFZ there exists 
a unique -linear isomorphism Lg of G onto G* such that 


(74) Lou = Lyu (ueF) 
and 
(75) Me = L[M] (M = GQ). 


In fact, by theorem 2.1 there exists an automorphism 6’ of D and a 
6’-linear isomorphism L’ of G onto G* such that M“ = L'[M] for all linear 
manifolds McG. Since M*=L'[M]=L,{M] for all linear manifolds 
MeF, we conclude from the same theorem that there exists a nonzero 
céD such that (i) Z’u=cL,u for all we F, and (ii) a” =ca°c™} for all 
acD. If we write Lg=c™1L’, then Lg is a @-linear isomorphism of G 
onto G° satisfying (74) and (75). The uniqueness of Lg follows from (74) 
and (75). 
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Since Lg is uniquely determined by (74) and (75), it follows quite easily 
that there exists a 6-linear map L of # into itself such that 


(76) Lu = Lgu (wEeG, GEF). 


We claim that L is one-one and maps # onto itself. 

To prove that L is one-one, let re # and Lx=0. Evidently, x eG 
for some Ge F and Lx= Lox =0. Therefore x=0 as Lg is an isomorphism. 
If ye # there is a GeF such that y € G* and once again, as Lg maps 
G onto G*, there will exist an x e G such that Lr=y. Since any Me L 
with finite dimension is contained in some element of F, we see that L 
is a ĝ-linear isomorphism of # onto itself such that 


(77) M* = L[M] (dim M < œ). 


Let now GeF. Since « preserves orthogonal complementation, it 
follows that for x, y e G, <x,y>=0 if and only if <Lz,Lyà=0. Thus, the 
semi-bilinear forms x, y > <x, yy and x, y > < Læ, Lyt- Y of Gx G induce 
the same polarity of the geometry of subspaces of G, and hence there exists, 
by virtue of theorem 2.3, a unique dg #0 in D such that 


(Lx, Ly = <ayy*dg (x, ye G). 


Since G e F is arbitrary and since dg is uniquely determined for each G, 
it follows that dg is independent of G and that 


(78) <Lx, Ly» = <x, yy’d (x, YE H) 


for a nonzero d e D. 

We now show that 0 is continuous. This is obvious for D=R or H, 
since @ is the identity in the former case and is an inner automorphism 
in the latter. Consider now the case D = C. If 0 is not continuous, there 
exists a sequence {c,} in D such that c, —> 0, but |c,°| —> œ. Choose an 
orthonormal infinite sequence {e, : n=1, 2,---} in # and let z,=ne,. By 
replacing c, by a subsequence if necessary, we may assume that 


(79) || Lx,|/|c,9]| >0 (n—> o). 
Write 


x 
c 
z=) e,. 
a n 


Since c,* — 0, z is well defined in #. Clearly, z#0. Then, <x,,z)=c, and 
hence, for all n, u,=<z,z>~1z2—c,~ 1x, is orthogonal to z. Consequently, 
from (78) we obtain, for all n, 


(Lu, L2 = 0. 
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This implies, on simplification, that 
(80) (<2,2)°)~*< Lz, Lz) = (Cn) Larn Lz) 
for all n. But 
(en?) *| |< Lan, Lz] < |en?|-*| Lar, Lel, 


which tends to zero by virtue of our assumption (79). Hence (80) implies 
that <Lz,£z>=0 or z=0. This contradiction shows that @ is continuous. 

We are now ready to complete the proof of theorem 4.27. Since @ is 
continuous, it follows that for any real number r, r?° =r. Hence we see, by 
putting x=y in (78), that d is a positive real number. Write T =d~-}/?Z, 
Then T is a 6-linear isomorphism of # onto itself such that 


(81) (Tx, Ty) = ay)” 


and 
M* = T[M] (Me Y,dim M < œ). 


Since |Txl = |x|, T is an isometry and is hence continuous. It follows 
therefore that 
M* = T{[M] (Me). 


The remaining statements are easy consequences of theorem 2.3. 
Therefore, we omit their easy verifications. The proof of theorem 4.27 is 
complete. 


We shall say that T induces « if (71) is satisfied. 


Corollary 4.28. If D=R or H, then any automorphism a of L is induced 
by a unitary operator of #. If D=C, then «a? is induced by a unitary operator. 


Proof. For D = R there is nothing to prove. If D =H and T; is a 0-unitary 
operator inducing a where @ is the automorphism 


d —> cde~}(\c| = 1), 


T'=c~*T is a unitary operator inducing «. If D =C and « is induced by a 
*-unitary 7',, T? is unitary and induces «?. 


In the physical literature it is customary to prove the above theorem in 
a somewhat different formulation (cf. Wigner [3]). To motivate Wigner’s 
formulation we must introduce the notion of transition probabilities. Let 
r, r' be rays in # and ọ, o’ unit vectors in r and r’, respectively. Then 
|<~,p’>|? evidently depends only on r and r’ and not on p and go’. We 
denote it by [r,r’]. It is called the transition probability between the 
states p and p’ which correspond (cf. theorem 4.23) to the rays r and r’. 
If a physical system © is in the state p, [r,r'] is the probability that it will 
be actually found in the state p’ when an experiment is performed to 
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decide whether it is in the state p’ or not. We shall define a physical 
symmetry associated with the logic Z to be any one-one map a(r —> r*) of 
the set of all rays of # onto itself with the property that 


(82) [r,r] = [r,r] 


for all rays r, r’. The following theorem describes the natural connection 
between physical symmetries and symmetries of #. 


Theorem 4.29. (Wigner [3]). Let # be a separable infinite dimensional 
Hilbert space over D and T a symmetry of HX. Then r —> Tir] is a physical 
symmetry associated with L. Conversely, let a(r —> r“) be any one-one map 
of the set & of all the rays of # onto itself with the property that r_\_r’ if 
and only if r*_|_r'*. Then, «a is a physical symmetry associated with L, 
and there exists a symmetry T of H such that r*=T{[r] for allre@. T is 
determined by « up to multiplication by a number c e D with |c| =1. 


Proof. If T is a symmetry of X, |(T'z,Ty>| =|<x,y>| for all z, ye X. 
From this we conclude easily that r— Tir] is a physical symmetry 
associated with Z. We now examine the converse. Let a(r —> r“) be any 


one-one map of 2 onto itself which preserves orthogonality. For any 
subset A of & let 


(83) A% = {r* : re A}, 

(84) [A] = {x:xeX, xer for somer in 4}, 
and 

(85) At={r:r | r forall r’e A}. 


We shall first prove that if AS2 is such that M =[4] is a closed linear 
manifold of #, i.e., if A is the set of all rays of a closed linear manifold 
M of #, then [A“] is also a closed linear manifold. To prove this, we note 
that as M =[A] is a closed linear manifold, a ray r belongs to A if and only 
if re(At+)+. But as a preserves orthogonality, this means that a ray 
r'e A if and only if r’ e(4+°)+, i.e., A*=(A+*)+. But for any subset 
N EZ, the set [S+] is a closed linear manifold. Consequently [4°] is a closed 
linear manifold. . 

We now set up a correspondence, also denoted by «, between elements 
of Z. For any M e F, let A be the set of all rays r such that r € M; we set: 


(86) M* = [4°]. 


The argument of the preceding paragraph has shown that M*e LY. By 
very definition it is clear that «(M —> M“) is an order-preserving map of 
£. Since ra M+ if and only if r is orthogonal to all rays contained in M, 
it follows that (J/+)*=(M*)'. Thus « preserves orthogonality. These 
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results applied to the map «`! of Z show that the map M -> M@"» 
defined by (86) is the inverse of the map M —> M“ of L. Consequently, 
we may conclude that M —> M* is a lattice automorphism of £ which 
preserves orthogonality. 

By theorem 4.27 there exists a symmetry T of # such that M*=T[M] 
for all M e L. But then r*=7{[r] for all rays r showing that r —> r“ is a 
physical symmetry associated with Z. Since any automorphism of Z 
is determined by its restriction to the set of all one-dimensional subspaces 
of #, it is clear that T is determined by the map « of Z up to multipli- 
cation by a number ce D with |c|=1. This completes the proof of the 
theorem. 


In Chapter III we introduced the concept of a convex automorphism 
of the set of all states. We shall describe all such automorphisms of the 
set of states of a standard logic. 

Let T be a symmetry of the Hilbert space #, and W the convex set of 
all bounded, nonnegative, self-adjoint operators U of trace class with 
tr(U)=1. For any Ue W let a,(U) be defined by 


(87) ar(U) = TUT-, 


Lemma 4.30. Let U ce W and let T be a symmetry of X. Then ar(U) E W 
and U —> ar(U) is a convex automorphism of W. Moreover, T —> ar is a 
homomorphism of the group of symmetries of # into the group of convex 
automorphisms of W. 


The proof is a routine verification. We leave it to the reader. 
If T is a symmetry of # and if we set, for Ue VW, 


(88) ar(py) = Pear (uy 


where py is the state M —> tr( PU) of Z, then the map py —> a«7(py) is a 
convex automorphism of Z. We shall prove that any convex auto- 
morphism of Y is of this form. Let W , denote the set of all von Neumann 
operators on #. Clearly, WoW ,. If T,,T,€W ,, we write Ti <T to 
mean 7',—7,>0. 


Lemma 4.31. Let œ be a convex automorphism of W. Then there exists a 
unique one-one map «~ of W , onto itself such that 


(ii) a@~(C,A, +c242) = ca~ (41) +C20™ (42) 
(A,, A42 EW 4, Cis C2 real numbers >20); 
a~ has then the property: 


(iii) tr(e~(A)) = tr(4) (Aew), 


(89) v) T, <T, ifandonlyif a~(Tı) < «`(T. 
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Proof. Define «a~(0)=0. If TEeW, and 70, t=tr(T)>0, and 
tT EW, we then put 


(90) a~(T) = ta(t-*7). 


It is trivial to check that «~ has the properties described in the lemma. 
The uniqueness of «~ follows from the fact that the property (ii) of a~ 
implies (90). Equation (90) implies that tr(a~(7’))=tr(7'), proving (iii). 
Finally, let T, < T}. Then 


a~(T) = a~ (Ty) +a~(T2—T) = «~(T)) 


as a~(7,.—T7,)€W , and is hence >0. Conversely, let «~(7';)<a~(T.). 
Since «~ maps W , onto itself, «~(7.)=a~(T,)+a~(T) for some TEW ,. 
Evidently, T,=7,+7, and hence T2 T. 


Lemma 4.32. Let pe X (j=1, 2) be two unit vectors and let P, be the 
orthogonal projection on the one-dimensional subspace D-¢,. Then the two 
following statements are equivalent. 


(91) (i) Pı L P2 
(ii) For any AEX ,, the relations P, <A and P< A imply that 


tr A > 2. 


Proof. Suppose that 9, | p2 and suppose that AEW , satisfies the 
relations P, <4 and P,<A. Let p3, 9,,-+- be vectors in # such that 
{p1, P2 ++} is an orthonormal basis for #. Then, as A20, 


tr(A) = 2, (Agni Pn? 


(Appi +<AGa,Ho> 


= 
= [Pipp +PP). 
Consequently, 
tr(A) > 2. 


This shows that (i) implies (ii). 

Conversely, let (ii) be satisfied. Let us assume that ¢, is not orthogonal 
to pa. We shall construct an AEW , such that P, <A and P,<A but 
tr(A) <2. Two cases arise. 

Case (a): pı and pz are linearly dependent. In this case P, = P, (= P say), 
and if A =P, then tr(A)=1. 

Case (b): pı and pz span a two-dimensional subspace S of #. It is 
obviously enough to consider the case when S=#. Let pz be a unit 
vector of S orthogonal to pı. Then for some a,bED, p, =ap, +bọ7. 
Neither a nor b is zero and |a|?+|b|?=1. Write y= |ala- tp; and 


Ya = |alam*|b[-*d- pz. 
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Then {pı¥2} is an orthonormal basis for S and = |a|p, + |b|. Clearly 
P, is the projection on the subspace D-. With respect to the orthonormal 
basis {¢,.}, Pı and P, have the matrices given by 


1 0 jal? a [| 
0 0 ja] |b]? 
Since |a|?+|b|?=1, the eigenvalues of the matrix of P,—P, are + |6|, 


and hence there exists a unitary operator V such that the operator 
V(P,—P,)V~-* has the matrix given by 


[| 0 
V(P,—P,)V-iw . 
(P,—P,) Y o 


Let M be the unique operator such that VMV-Ż+ has the matrix given by 


VMV- ~ (" o) 
0 0 


Obviously M is self-adjiont, >0 and tr(M)=]|b| <1. Further, 
VMV-i3—V(P,—P,)Vi2=0 


and hence M>P,—P,. Let A=P,+M. Then P,<A, Pa<A but 
tr(A)=1+ |b| <2. This proves the lemma. 


Theorem 4.33. Let 7 be a separable infinite dimensional Hilbert space 
over D and T a symmetry of #. For any UEW let py be the state 
M — tr(P™”U) of L and Ur=TUT-!?. If 


(92) ar(Py) = Pups 


then ar(Pu —> Pu,) îs a convex automorphism of S. Conversely, let a be a 
convex automorphism of S. Then there exists a symmetry T of H such that 


(93) a= ar. 
T is determined up to multiplication by a number c e D with |c| =1. 


Proof. The first part of the theorem is essentially the content of lemma 
4.30. To prove the converse, let a be a convex automorphism of S. By 
theorem 4.23, a induces a convex automorphism of W, denoted once 
again by a. Let a~ be the map of W, determined by a (cf. lemma 4.31). 
Since the extreme points of W are the orthogonal projections on the one- 
dimensional subspaces of # and since « is convex, «œ induces a one-one 
correspondence of the set 2, of all rays of #, onto itself. Let a(r —> r“) 
be this correspondence. Since a~ preserves the trace and the partial 
ordering < of W, (by (iv) of lemma 4.31), and since the orthogonality 
of two rays has been completely characterized in terms of the trace 
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function and the partial ordering < in lemma 4.32, it follows at once that 
r—>r* is orthogonality-preserving. By theorem 4.29 there exists a sym- 
metry T such that r* = T[r] for all rays r. If ær is the convex automorphism 
of F associated with T, this means that «r and « coincide on the set of 
extreme points of W. From this it follows that «=a,;. The fact that T is 
determined up to multiplication by an element ce D with |c|=1 now 
follows from theorem 4.29. The theorem is completely proved. 


4. LOGICS ASSOCIATED WITH 
VON NEUMANN ALGEBRAS 


If the logic # of a quantum mechanical system is not isomorphic to a 
standard one, then the question of describing its observables, states, and 
symmetries becomes a more difficult one. There are no general results in 
this connection. It is our aim to give a few examples which illustrate the 
wide possibilities as well as the difficulties involved in the construction 
of any general theory. 

A large class of logics which are sublogics of standard logics may be 
constructed using the theory of von Neumann algebras. We confine 
ourselves to the case of complex scalars. Let D=C be the complex number 
field and # a separable infinite dimensional Hilbert space over €. We 
write 2(%) for the algebra of all operators on #. For u, ve #, let 


Nyy : A > <A4,v> (A €U(H)). 


Then the A,, are linear functionals on (#4). The weak topology on 
2 (#) is the smallest one with respect to which all the A, , are continuous. 
A von Neumann algebra A is a subalgebra (containing J) of A( 2) such that 
(i) if A e A, then A* e A, and (ii) W is a closed subset of (#4) in the weak 
topology. If X’ is the set of all elements of 2(4) commuting with all the 
elements of 2, W is also a von Neumann algebra and (WY = (cf. Dixmier 
[1] for the general theory of von Neumann algebras). 
Let NX be a von Neumann algebra. We define 


(94) Ly ={M: M E Z(H), P” e W. 


It is then easy to verify that Z, is a logic. We shall call it the logic associ- 
ated with A. A closed linear manifold M lies in #, if and only if every 
element of 2’ leaves M invariant. Thus the logic Zy may also be intro- 
duced as the logic of closed invariant linear manifolds of some von 
Neumann algebra. 

The one-one correspondence x —> A, between observables of #y and 
self-adjoint operators on #, which was discussed in section 2 of this 
chapter, clearly persists even in this case. However, not every self- 
adjoint operator is the operator of an observable associated with Zy 
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Let A be a self-adjoint operator, not necessarily bounded. Then A =4, 
for an observable x of Z, if and only if the spectral projections of A lie 
in X. If A is bounded, the condition will then be that A e A. If A is un- 
bounded, the condition will have to be stated more carefully. In this case, 
A=A, for an @,-observable if and only if UAU~1=A for all unitary 
operators Ue’ (cf. Dixmier [1]; such A’s are said to be affiliated 
to A). € =A N A’ is the center of the algebra A. If we write Le =€ A Lu 
then, Ze is the center of the logic Zy. Ly is irreducible if and only if 
L¢={0,J}, or, equivalently, if @ consists only of multiples of the identity, 
i.e., in the standard terminology of von Neumann algebras, if and only if 
A is a factor. 

It was Murray and von Neumann [1] who pioneered the entire study of 
factors. They classified factors into three types, I, II, and III. A factor A 
is of type I if and only if A is isomorphic to the algebra A(X) of some 
Hilbert space X. From our point of view, 2 is a factor of type I if and only 
if Ly is isomorphic to the standard logic L (X) for some %. The results of 
sections 4.2 and 4.3 therefore give a complete description of the sets of 
observables and states when Y = &,, for some factor A of type I. When 
N is a factor but not of type I, no similar theory exists. As an example, 
we may mention that the problem of determining all the states of Zy, 
when %X is a factor type IT or III. 

We have remarked previously that if Y is not a factor, the center Ly 
of Za is nontrivial. We have also seen in Chapter III that there is an 
intimate connection between Sge and the pure states of Y,. It is clear 
from theorem 3.19 that if any contact is to be made with conventional 
quantum theory, “¢ cannot be continuous. At the other extreme lies the 
situation in which Ze is discrete, i.e., when there exist mutually orthog- 
onal elements M,, M,,--- in Ze such that (i) Vn, M, =, and 
(ii) Me Ye if and only if for some n,<n,.<---,M=\/, Mn, The 
M,’s are then the minimal elements of Hy. A case when Zg is discrete is 
obtained when we choose M, as above, and define A to be the set of all 
operators which leave each M, invariant. Then, for a closed linear mani- 
fold M of X, M € Z, if and only if M=\/, (Ma M,). If we assume that 
dim M,,>3 for all n, we can describe all the pure states of #,. In fact, 
from theorem 3.19 and Gleason’s theorem we conclude immediately that 
p is pure if and only if there exists an n and a unit vector p in M, such that 


(95) p(M) = |P*p]? (MeZ). 


In other words, the pure states of Z, are still in one-one correspondence 
with the rays of #; however, not every ray can be used. Only the rays 
which belong to some M,, correspond to pure states. Physicists describe this 
situation by saying that no ray of #, which is a nontrivial superposition 
of rays in distinct M,, can describe a (pure) state of Zy. Each M, is 
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called a coherent subspace of (pure) states. The conditions which demand 
that the rays describing the pure states should lie in M, for some n 
are usually referred to as superselection rules. (cf. Wick, Wightman, and 


Wigner [1]). 


5. ISOMORPHISM AND IMBEDDING THEOREMS 


It is very important in the foundations of quantum mechanics to ex- 
amine the extent of generality of the concept of an abstract logic. In 
particular, the question arises whether every logic can be regarded as a 
sublogic of a standard logic. Definitive answers to such questions are 
not known. We shall prove in this section some theorems which analyze 
the structure of a somewhat restricted class of logics. The main theorem 
of this section was proved by Piron [1]. A related but somewhat weaker 
result was obtained by Zierler [1]. 

The main tool of our analysis is the coordinatization theorem of Chapter 
II. In order to be able to apply that theorem we must impose a sharp 
restriction on the class of logics to be analyzed. We introduce accordingly 
the following definition. A logic # is said to be projective if the following 
conditions are satisfied: 


(i) given a40 in Y, there is a point x <a; 
(ii) if a#0 in # is the (lattice) sum of a finite set of points, then 
£{(0,a] is a geometry of finite rank; we shall say that a is a 
(96) finite element of # and write dim(a) for the dimension of L {[0,a]; 
(iii) if z,ae LY, a0, #1 and zis a point, then there are points 
y,z € £ such that y <a, z<at andr<yvz; 
(iv) there exists at least one a e Z such that 4<dim(a)< œ. 


For any a e€ &, let 
(97) Pa) = {x : x a point, x < a}. 


Note that if Z is projective and its lattice is complete, then every element 
a of ¥ is the lattice sum of the points it contains. In fact, if b is the sum 
of these points and ba, c=b+ Aa#0 and there would exist a point x<c 
by (i) of (96) which contradicts the definition of b.: 


Lemma 4.34. Let Z be a projective logic whose underlying lattice is 
complete. If a and b, (j e J) are elements of L and ifa |b, for allj, a} \/, b; 
We have the usual identities: 


(Yb) = Ade 
(Av) = Ve 
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If a,be L, then a<b if and only if B(a)SB(b); alb if and only if 
B(a) LP(b). 


Proof. The first two assertions are proved exactly like the corresponding 
assertions in lemma 3.1. We therefore come to the proof of the last 
assertion. If a<b, (a) EP(b) obviously. On the other hand, let a <b. 
Then aAb¥a and hence, by (96) (i), there exists a point x| a^b such 
that x e P(a). Clearly x ¢ V(b). If a |b, B(a) | B(b) obviously. Suppose 
now that (a) LP(b). Then for any x e B(a), x1 Vego y=. Therefore 
bi V zega) =A. 

Let D be a division ring and V a vector space, not necessarily finite 
dimensional, over D with dimension at least 4. Let 0 be an involutive 
anti-automorphism of D and let <. ,.ẹ be a 6-bilinear symmetric form on 
Vx V. Let us assume that ¢., .> is definite; i.e., (z,2)>=0 if and only if 
x=0. For any set M of vectors we write 


(98) M+ ={u:<xu>=0 forall xe M}. 


Obviously, M+ is a linear manifold in V, M A M+=0, and Mc M+. If 
M is a linear manifold and M = M+, we shall say that M is <. , .)-closed. 
Note that no topology is involved in this definition. The pair (V,<., .>) 
is said to be Hilbertian if for any closed linear manifold M, one has: 


(99) V = M +M+ 
algebraically. For any set M S V we write 
(100) M- = MH 


If A, BSV we write AJ] B whenever <a,b>=0 for allac A and be B. 
Clearly 0- =0, V- = V, and 0*=V, V+=0. If M and N are linear mani- 
folds and MCN, then N'c M+. If {M} is a collection of linear manifolds, 
M their algebraic sum, and N | M, for all j, then N | M. All this is trivial. 


Lemma 4.35. The following statements on a set MC V are equivalent. 
(i) Misa<¢.,.>-closed linear manifold. 

(ii) M = M-. 

(iii) M = N+ for some st NS V. 


Moreover, for any set MEV, 


(101) MH = M+, 
(102) (M-)* = Mt, 
and 


(103) (M-)- = M-. 
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Finally, if {M} is a collection of <. , .>-closed linear manifolds, ()\, M, is 
also a <., .y-closed linear manifold. 


Proof. Clearly for any set MCV, Mc M++. Thus M+c(M+)!+. On the 
other hand, as Mc M+, (M++)+c M+. Therefore M+= M+, which is 
(101). Since M- = M++, (102) follows from (101). Further, 


(M-)~- = (MH = (Muy = MH 


by (101), so that (M-)-=M-. This proves (103). We now come to the 
proof of the equivalence of (i) through (iii). Suppose M is a <., .>-closed 
linear manifold. Then M =M++, so that M=M-~-. Thus (i) = (ii). If 
M=M~-, then M =M =(M+)}+, proving that (ii) = (iii). Suppose now 
that M =N+. Then, by (98), Mt+=N1!+=N+=M, proving (iii) = (i). 
Finally, let M=(); M;. As MSM, M+cM,+=M,. Thus 


Mc ANM, = M. 
j 


This proves that M is a <., .>-closed linear manifold. 


Corollary 4.36. If M&V, M- ts the smallest <. , .)-closed linear manifold 
containing M. In particular, if MCN, then M~CN-. 


Proof. M- is <. , .>-closed by (103). It is therefore enough to prove that 
if N is a <.,.>-closed linear manifold and MEN, then M-CN. But 
M-=MttcNtt=N. Finally if MCN, MCN, and as N^ is <.,.>- 
closed, M- ENS. 


Lemma 4.37. If M is a finite dimensional subspace of V, then M is <., .>- 
closed. 


Proof. It is enough to prove that M++& M. Let xe M++, Let N be the 
subspace spanned by M and x NC M++. Since <¢.,.)- is definite, its 
restriction to N x N is definite and hence nonsingular. But N is finite 
dimensional and hence it follows from (13) of Chapter II and the definite- 
ness of <.,.> that N is the direct sum of M and (M+ N^ N). Therefore, 
x=x' +x" where x'e M and x” e M+ A N. Since NGM, x” e M+ A M+ 
and hence xz” =0. Thus ze M. 


Lemma 4.38. If {M} is a family of linear manifolds of V, and if >, M, 
denotes the (algebraic) linear span of the M,, 


(104) (2 M) = (Q M+). 


Proof. Let M=)>, M,. Since M,& M, M+cM,' so that MSN; M,. 
This shows that M- =M1+3((), M,+)+=N, say. On the other hand, as 
Q; M EM, M;oM, SN for all j so that MCN. Since N is ¢., .>- 
closed, it follows from corollary 4.36 that M- €N. Therefore N=M-. 
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Lemma 4.39. If (V,<.,.>) is Hilbertian and M,, Ma,- -, M, is a finite 
set of mutually orthogonal <., .>-closed linear manifolds, then >$_, M, is 
also a <.,.>-closed linear manifold. 


Proof. It is enough to consider the case s=2 since the general case 
follows at once by induction. We want to prove that 


(M,+M,)~ = M, +M. 

Write M =M, +M.. Let xe V. Since (V, <., .>) is Hilbertian, we have 
the unique decompositions 

x =x +y (2,6 M, yeM,+*), 

Y = ta +Z (z2 € M3, z E M,"). 
Now, as M€ M,1, £z E€ M,* so that z e M,+ also. Therefore, 

ze M+ A M, 

and we have: 


x = ti +t +? (x, e Mi, x2 E M3, 2E M,* A Mat). 


Suppose now that x e (M,+M,)~. Then the above equation shows that 
ze (M+M) 0 M,+ A M,", which implies that z=0 on using (104). 
Therefore (M, +M) =M, +M.. 


Theorem 4.40. Let D be a division ring, V a vector space over D with 
4<dim V <œ, 0 an involutive anti-automorphism of D, and <.,.> a 
definite, symmetric, b-bilinear form on V x V. Let £(V,<.,.Y) be the set of 
<. , .>-closed linear manifolds of V, partially ordered under inclusion. If 
(V,<.,.>) is Hilbertian, then L(V, <.,.>) is a complete projective logic, 
and for any collection {M,} of <. , .>-closed linear manifolds of V, the lattice 
operations in L(V, <.,.>) are given by 


VM i7 (> M i E 

j j 
Conversely, let L be any complete projective logic. Then there exists a division 
ring D, an involutive anti-automorphism 6 of D, a vector space V over D, 


and a definite symmetric 6-bilinear form <., .y on V x V such that (V,<., .>) 
ts Hilbertian and L is isomorphic to L(V, <., .>). 


(105) 


Proof. Let D, 0, V, <.,.X be as given in the first half of the theorem. 
Let 2=L(V,<¢.,.>) be the partially ordered set of ¢. , .)-closed linear 
manifolds of V. Assume that (V,<¢.,.>) is Hilbertian. For any linear 
manifold M, M- is the smallest element of Y containing it. It is clear from 
lemmas 4.35 through 4.38 that Y is a complete lattice with the lattice 
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operations given by (105). Next we observe that if we define, for any 
M e L, M+ by (97), then Z becomes a logic. To check this, the only 
thing not immediately obvious is the weak modularity (2) (ii) of Chapter 
TIT. Suppose then M, MEF and M,cM,. Let N=M,'OM,. By 
lemma 4.35, N is a <. ,.)-closed linear manifold so that Ne. Clearly 
NSM, N1 M,. Since (V,¢.,.>) is Hilbertian, V=M,+M," and for 
any x e V we have the decomposition x =x; +22, where xz, e M,, 7, E M,1. 
If now xe M, then x, and x€ M, also, so that xe N. Therefore 
M,=M,+WN and a fortiori Ma =M, vV N. Z is thus a complete logic. We 
claim that it is projective. By lemma 4.37, Z contains all finite dimensional 
subspaces. As dim V >4, (i), (ii), and (iv) of (96) follow at once from this 
fact; (iii) follows from the algebraic decomposition V = M + M+ for any 
M e L. £ is thus a complete projective logic. 

We now come to the converse. Let Z be a complete projective logic. 
Let us now define L” by 


(106) L' ={a:aeL, a finite}. 


By (ii) of (96), a e Z” if and only if the underlying lattice of ([0,a] is a 
geometry. Under the induced partial ordering, Z” is thus a generalized 
geometry. Therefore there exists by theorem 2.16, a division ring D, a 
vector space V of dimension at least 4 over D, and an isomorphism of £” 
onto the generalized geometry of all finite dimensional subspaces of V. Let 
y denote any such isomorphism. 

Fix a nonzero vector vo € V. Let W be any finite dimensional subspace 
of V containing v and of dimension at least 3. The isomorphism y trans- 
fers the orthocomplementation in [0,a] into an orthocomplementation 
on the projective geometry of subspaces of W. Hence, by theorem 2.7 
there exists an involutive anti-automorphism Oy of D and a symmetric, 
definite 6,-bilinear form <.,.>w on Wx W, inducing the orthocom- 
plementation in question, with <vo vow =1. We now argue as in lemma 
4.2 to conclude that 0w =0 is independent of W and that there exists a 
symmetric, definite 0-bilinear form <. , .> on V x V such that (i) (vp,v9) = 1, 
and (ii) if x and y are points of Z, x] y if and only if the rays y(x) and 
y(y) are orthogonal with respect to <., .>. 

Let LY~=L(V,<.,.>) be the lattice of all <., .>-closed linear mani- 
folds of V. We shall now extend y to an isomorphism y~ of # onto L~. 
For any ae &, let 


(107) y~(a)={v:veEV and vey(z) for some point x < a}. 


We shall first show that y~(a) is a linear manifold in V. This is obvious 
for a=0 or a=1, as then y~(0)=0 and y~(1)=V. Let then a40, £1. If 
vi, Vo E y~ (a), there are points x,, xa <a such that v, e y(z,), j=1, 2. Now, 
as y is an isomorphism of Y onto the generalized geometry of finite 
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dimensional subspaces of V, y(x,) and y(x2) are rays in y(x; V £2). Thus if 
V=C,Vı +CV Where c, €2 E D, v lies in y(x, V x2) and hence v e y(x) for 
some point x< z, V%_<a-y~ (a) is thus a linear manifold. We shall prove 
next that y~(a@) is <. , .>-closed. From lemma 4.34 it follows easily that for 
a point xe Z, x<a if and only if z_| y for all y e P(at). Therefore, 


(108) y~ (a) = (y~ (a+))*, 


which shows that y~ (a) is <. , .ẹ-closed. Moreover, as (108) is valid for all 
a (in particular, for a+), we have: 


(109) y~ (at) = (y~(a))’. 


(109) shows that y~ preserves orthocomplementation. From (107) we 
conclude, on using lemma 4.34, that for a, be Z, y~(a)Sy~(b) if and 
only if a<b. In particular, y~(a)=y~(b) if and only if a=b. Therefore, 
in order to prove that y~ is a lattice isomorphism of # onto F(V,<., .>), 
it remains only to prove that y~ is surjective. Let M be any <.,.>- 
closed linear manifold of V and let the subsets A and B of # be defined 
as follows: 


A = fx: xe L, xis a point such that y(x) S M}, 
B = {x : xe Ẹ, x is a point such that y(x) S M+}. 


Since M=M++, xe A if and only if x| B. Let a and b be the lattice 
sums of the points of A and B, respectively; these exist as Y is complete. 
It follows from lemma 4.34 rather easily that a and b are orthogonal 
complements of each other and A=(a), B=Ņ(b). The equation A = f(a) 
implies that y~(a)=M. The proof that y~ is surjective is complete. y~ is 
thus a lattice isomorphism of L onto £(V,<.,.>). 

We observe finally that (V, <. , .ẹ is Hilbertian. In fact, let M, 40, 4 V, 
be any <. , .>-closed linear manifold, and v e V, v#Æ0. Let a e F be such 
that y~(a)=M. Clearly, a#0, 41. Then y“ (a+) = M+. Choose x e £ such 
that v e y(x). By (iii) of (96), there are points y, z of Z such that y <a, 
z<at and x<yvz. Hence y(x) <yly) v y(z)=yly)+y(z). This shows that 
veyly)+y(z)EM+M-+ and demonstrates the Hilbertian character of 
(V,<. , .>). This completes the proof of the theorem. 


The division ring D which enters the second half of theorem 4.40 is 
uniquely determined by Z (up to isomorphism of course), as can be 
easily deduced from theorem 2.1. If D were one of R, C, or H, and Ẹ isa 
complete projective logic, we shall say that Z is associated with D if the 
involutive anti-automorphism 6, constructed in theorem 4.40, is con- 
tinuous. This is a restriction only in the complex case. If # is associated 
with D, 0 must coincide with the conjugation *, which is complex con- 
jugation when D=C, the identity when D=R, and the standard con- 
jugation when D=H. In view of theorem 4.40 and these comments we can 


120 GEOMETRY OF QUANTUM THEORY 


assert that, given any complete projective logic Z associated with D 
which is one of R, C, or H, there exists a pre-Hilbert space V, with inner 
product <.,.>, such that (i) (V,<.,.>) is Hilbertian and (ii) & is iso- 
morphic to the complete projective logic of all <. , .)-closed linear mani- 
folds of V. It is natural to expect that V is then actually a Hilbert space. 
This essentially leads to Piron’s theorem. We shall obtain it as a con- 
sequence of a few lemmas. 

It is an interesting problem to examine what natural assumptions on a 
geometry imply that the associated division ring is isomorphic to one of 
R, C, H. This is a classical question and is intimately connected with the 
topologies on a geometry (cf. Kolmogorov [1], Weiss and Zierler [1]). 


Lemma 4.41. Let V be a pre-Hilbert space over D (one of R, C, or H) and 
let Æ be tts completion. If £o, xo E Æ are orthogonal, there exists a pair of 
sequences x, and x, such that 


(i) x, L 2,’ (n,m = 0, 1, 2,---), 


(110) (ii) £as in EV and 2x, —> 2%, Em > 2X’. 


Proof. We shall prove first that if ze # is a nonzero vector, then 
{z}- A V is dense in {z}-. In fact, let y | z, and let {y,}, {z,} be sequences of 
elements of V such that Yy, —> Y, z, > z. Write 


(111) Yn = Yn—< Ym >2ns2> Zp. 
Then y, € V and Cy,’,z2>=0 for all n. As y, > y and z, —> z, 
€ Yast ><Sns%> ~ KA —> [LY zyz, z» lz = 0, 


showing that y,’—>y. This proves our assertion. Suppose next that 
Y,Y1,°°*, Ys are mutually orthogonal nonzero vectors. Applying repeatedly 
the result proved just now, we conclude in succession that {y,}+ A V is 
dense in {y,}+, {y¥,Ys-1}+ A V is dense in {y,,y,_,}+ and so on. In other 
words, given any finite dimensional subspace Y of # and a vector y | Y, 
there exists a sequence {z,} in V such that z, |] Y for all n and z, > y. 

This said, we come to the proof of the lemma. Let 2, 7)’ E€ # be given 
with zol tọ. We shall show by induction that there are sequences 


{x,} and {x,’} in V such that 
112) [azal S 27%, aneo <2 
Ly L Lm’ (n,m = 0, 1,2,---). 


Suppose 2, %,---, 2, and x,’, X2',---, x,’ have been constructed so that 
(112) is satisfied for all n, m=0, 1, 2,---,k. By the result proved in the 
previous paragraph, there is a vector £k, E V such that 


+1 L Ln (n = 0, 1,---, k), 


lini —%o | < 2Q-(k+)). 
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Applying the same argument again, we can find z,,, E V such that 


ear L Xp" (n = 0, 1,---,&+1), 


|t,41—Xo| < Q-(k+1). 


Thus {X,}o<cnsx+1 and {%y'socn<n+1 Satisfy (112) for n and m<k+1. This 
proves by induction the existence of a pair of sequences satisfying (112). 
This immediately proves lemma 7.41. 


Lemma 4.42,¢ Let V be a pre-Hilbert space over D such that (V,<., .>) 
ws Hilbertian. Then V is complete. 


Proof. Let # be the completion of V and 2, an arbitrary unit vector of 
H. We denote, for any subset L of #, its orthogonal complement in #7 
by L'. We must prove that x) € V. Since V is dense in #, there is a y e V 
such that (y,7)> 40, and by changing y to a suitable multiple of it we may 
assume that ¢y,%)>=1. Let 29=y—a. Then y=2x +2 and zol Zo By 
lemma 4.41 there are sequences {x,} and {z,} in V such that 


(113) En > Zo, Zn —> Zo, Ln L Zm (n,m = 0,1, 2,---). 


Let Mc V be the lattice sum in Z (V, <., .>) ofthe rays D-2, (n=1,2,---). 
Then, as (V, <., .>) is Hilbertian, 


V = M + M+. 


Now z,, 1%, for all n so that z„ e M+ for all m. Hence x, c M° and 
Zo E (M+) where cl denotes closure in #. Moreover, it is obvious that 
(M+)1c (M°). Therefore, 


(114) Y = toto, XEM", zo e(MAY. 
On the other hand, as V =M + M+, we can also write 
(115) Y = Lo +HZo, % EM, z e M+. 


As M+c(M)’, a comparison of (114) and (115) shows that 2,’=2, and 
Zo =2Zq. In particular, x) € V. This completes the proof of the lemma. 


Lemma 4.43. Let Z be a projective logic with the property that any family 
of mutually orthogonal points of L is at most denumerable. Then L is 
complete. 


Proof. It is enough to prove that arbitrary lattice unions exist in Z. 
Let {a,: j e F} be a family of elements of #. For any countable subset 
DEF, let 

ce(D) = V a. 


jeD 


t The argument for completeness is essentially that of I. Amemiya and H. Araki, 
Publ. Res. Inst. Math. Sci. Kyoto Univ. A2 (1966), p. 423. Their theorem is that if 
V is pre-Hilbertian and ¥(V, <.,.>) is orthomodular, then V is complete. 
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Let B be a family of mutually orthogonal points x such that (i) for each 
xe B there is a countable set DOF such that x<c(D), and (ii) B is 
maximal with respect to (i). By hypothesis, B is at most countable. Let 


B= {x,, Za’ }, 
and for each n let D, S F be a countable set such that 
x, < c(D,) 
for all n. Write D=\_J, D,. If we define x by 
n= V Lny 


then x <c(D). If x#c(D), there will exist, by virtue of (i) of (96), a point 
y <c(D) such that y | x. Since D is countable, this contradicts the maxi- 
mality of B. Therefore x=c(D). We claim that a, <x for all j e F. Ifj e D, 
this is trivial. If j ¢ D and a, xx, a; vx, and hence by (i) of (96) there 
will exist a point ¢ such that t| x and t<a,v x. Since t<c(D U {j}) and 
t | B, we have a contradiction. Consequently, a, must be <x. This proves 
that x= V jer ay. 


Theorem 4.44, (Piron [1]). Let L be any logic. Then, a necessary and 
sufficient condition that L be isomorphic to the logic of all closed linear 
manifolds of a separable Hilbert space over D (which is one of R, C, or H) 
is that L be a projective logic associated with D and have the property that 
every family of mutually orthogonal points of L be at most countable. 


Proof. The proof follows at once from theorem 4.40 and lemmas 4.41 
through 4.43. 


NOTES ON CHAPTER IV 


1. Hidden variables: Is the quantum mechanical description of microphysics 
complete? The question of “hidden variables” has persisted in the foun- 
dations of Quantum Theory from its very birth. The basic assumption in 
Quantum Mechanics, namely that in any state of an atomic system a 
physical quantity can in general have no sharply defined value (even under 
idealized measuring conditions) but only a probability distribution of the 
possible values, appeared to many people to be in very sharp conflict with 
all classical experience, and hence unacceptable, unless it was interpreted as 
follows: one is not able to get a complete description of the physical state 
of the system; there are additional coordinates (“hidden variables”) that 
cannot be measured; and therefore the statistical character of the results of 
the experiments is due to the averaging over these hidden variables. This 
view, which regards the Hilbert space description of quantum states as 
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fundamentally incomplete, is reinforced by the example of statistical 
mechanics where the situation is essentially of this character. 

Von Neumann was among the earliest to analyze this question, and his 
conclusion was that no mechanism of hidden variables could account for the 
statistical content of Quantum Mechanics as currently formulated. That he 
regarded this question, as well as his resolution of it, to be of great importance 
is clear from the fact that he raises the issue already in the preface of his 
book (pp. ix, x), returns to it briefly (pp. 209-210) before devoting a 
substantial effort (pp. 295-328) to a detailed mathematical and physical 
treatment (all references are to the 1955 English translation by Robert T. 
Beyer of von Neumann’s book). We shall briefly summarize his argument 
but suggest strongly that the reader study von Neumann’s brilliantly 
presented analysis. Von Neumann’s reasoning, roughly speaking, has two 
parts to it, one physical, the other mathematical; and they reinforce each 
other. 

The physical side of his argumentation uses the technique of ensembles 
and begins by considering a statistical ensemble consisting of a large 
number of models of the physical system under consideration. If we reject 
the view of Quantum Mechanics, and suppose that the values of physical 
quantities in the ensemble have positive dispersions only because the 
ensemble has not been resolved in a sufficiently fine manner, we must admit 
that we should be able to carry out such a resolution into subensembles in 
which the dispersions are zero, or at least diminished from those of the 
original ensemble. The method of doing this is to measure the physical 
quantities in succession, replacing the ensemble each time to one of the 
subensembles where the quantity has a sharply defined value. But then, 
the Heisenberg uncertainty relations between complementary. physical 
quantities operate in such a way that the precision achieved for a quantity 
in any stage is destroyed at the next stage when a complementary quantity 
is measured; we thus do not get ahead. However, one may take exception to 
this line of reasoning by pointing out that there could conceivably be other 
methods of penetrating to the dispersion free ensembles. To settle this 
point decisively one must therefore answer the following question: Given 
an ensemble in which certain physical quantities have positive dispersion, 
is it possible to resolve this ensemble into a superposition of two sub- 
ensembles, different from the original one and distinct from each other? 
Ensembles which cannot be so resolved are the pure (homogeneous in von 
Neumann’s terminology) ones, and the above question becomes the 
following: Is every pure ensemble dispersion free? If the answer is no, then 
the hidden variables interpretation must be given up. 

To answer this question in the conventional model of Quantum Mechanics 
von Neumann begins with the very interesting observation that since an 
ensemble is characterized (statistically) by the expectation values of all the 
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physical quantities in it, one may replace the ensemble in the mathematical 
argument by the corresponding expectation functional; we must remember 
that if A is any bounded observable, its probability distribution is com- 
pletely determined by the knowledge of all the expected values 


Exp(A*), n = 0,1,2,.... 
Thus, for von Neumann, an ensemble is the same as the corresponding real 
valued (expectation) functional 

Exp: B,(#)>R 

on the space B,(#) of all bounded self-adjoint operators on the complex 
Hilbert space #; then von Neumann assumes that it satisfies the con- 
ditions: 
(a) Exp(1) = 1; 
(b) Exp(4?) 20, VAeB(#); 
(c) Exp is linear (over R); 
(d) Exp is continuous in the strong operator topology. 


(1) 


Nowadays, one works with the space B(#) of all bounded operators 
on #; the conditions (a)-(c) define a state, (b) being equivalent to 
Exp(AA') > 0, VAeB(#); with (d), we have a normal state. He showed 
that the functionals Exp are in one~one correspondence with bounded 
operators U which are self-adjoint, 20, and of trace 1, the correspond- 
ence being defined by 


(2) Exp(A) = tr(U A). 


The correspondence preserves convexity and so matches the pure states 
with the extreme points of the convex set of the U’s. He verified that the 
latter are the one-dimensional projections, and showed by explicit calcu- 
lation that these are never dispersion free. This concluded his proof. 

One must emphasize that his reasoning explicitly assumes that Quantum 
Mechanics is described by the conventional model. It is possible to modify von 
Neumann’s argument so as to allow for the presence of superselection rules, 
but the essence of the reasoning does not change. Many subsequent criticisms 
of von Neumann’s proof are unjust because they do not take into account 
the above assumption in his proof, although von Neumann states it quite 
unmistakably (see the first sentence of the first paragraph on p. 210, and 
lines 17+ on p. 324; see also the remarks of L. van Hove, Von Neumann’s 
contributions to Quantum Theory, pp. 95-99, in John von Neumann, 
1903-1957, Bull. Amer. Math. Soc. 64 (1958), No. 3, Part 2). The mathe- 
matical content of von Neumann’s argument may thus be summarized as 
follows: B(#) does not admit any dispersion free normal state. 
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The first question that arises naturally now is whether von Neumann’s 
definition of a state was unduly restrictive. Obviously the conditions (c), 
-(d), of (1) need to be examined more closely. A conservative definition 
would replace (c) by 


(3) (c) Exp(A+ JB) = Exp(A)+Exp(B) when AB=BA. 


Let us call maps A: B,(#) > R satisfying (a), (b) of (1) and (c’) of (3) physical 
states. It then follows that |\(A)| <||A|| and that the restriction of À to the 
projections defines a finitely additive probability measure on the logic 
L(A). Conversely, it follows from the spectral theorem that any finitely 
additive probability measure on £(#) is the restriction to the projections 
of a unique physical state of B,(#). In other words, physical states may be 
identified with finitely additive probability measures on L(I). Thus the 
strongest generalization (in this direction) of the von Neumann result 
would be to show that there are no finitely additive dispersion free prob- 
ability measures on £(#). The case when 3 <dim(#) <œ is immediate; 
for, by Gleason’s theorem, every finitely additive probability measure on 
L(A) is the restriction to the projections of a functional A—tr (UA) 
(U >0, tr(U)=1), and hence von Neumann’s result already applies here. 
If dim(#’)=0co, we use the isomorphism # I H @ C3 to imbed #(C?) 
inside (4°). Since for a finitely additive probability measure, being 
dispersion free is equivalent to its being {0,1}-valued, this property persists 
on restriction from L (H) to L (C), allowing us to use the previous argument 
to conclude that Y (H) has no dispersion free physical states. 

A few footnotes may be added to this brief discussion. In the first place, as 
we shall see later in these notes when we discuss Gleason’s theorem and its 
generalizations, physical states on any von Neumann algebra without J, 
summands are actually states; in particular, this is true for B(.#). Secondly, 
there are actually states that are not normal. In fact, regarding B(#) as a 
Banach space, the states form a compact (in the weak topology) convex 
subset of the dual of B(#); if (€m)m>, is an orthonormal basis of Z, 
Am(A) =(Aem em) (A E B(H)), and v is a limit point of {Àm}m>1 one can show 
that vis a state which is not normal. Our final remark is to point out that it 
was Mackey who first realized that one should really prove von Neumann’s 
result with physical states rather than states, or at least with countably 
additive probability measures on ¥(#); Gleason’s work which showed 
that every countably additive probability measure on #(#H) defines a 
normal state on B(#) at least when 3 <dim(#)<oo, was inspired by 
Mackey’s investigations. In addition to being a significant and aesthetically 
satisfying strengthening of the von Neumann result, this work and others 
that were inspired by it have led to a deeper understanding of the questions 
involved. The fact that physical states are the same as states was proved in 
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full generality only recently, and the main motivation for reaching this 
result had always been the desire to generalize Gleason’s theorem to more 
general von Neumann algebras than B(#). 

A second question that arises now is whether one can relax the assumption 
in the von Neumann-Mackey-—Gleason result that the logic of the physical 
system is Y (Z). There is an extensive literature on this question and I shall 
limit myself to a brief discussion of two results, those of Neal Zierler and 
Michael Schlessinger (Boolean imbeddings of orthomodular sets and Quantum 
Logic, Duke Math. J., 32 (1965), pp. 251-262), and of Simon Kochen and 
E.P.Specker (The problem of hidden variables in Quantum Mechanics, 
J. Math. Mech., 17 (1967), pp. 331-348); for more detail and other references 
the reader should consult the lucid discussion of these problems in the book 
of Beltrametti and Cassinelli (of notes to Chapter ITI). 

In both of these articles, the interpretation of what is meant by hidden 
variables is such that it implies the existence of some sort of imbedding of 
the proposition system into a Boolean algebra. So the mathematical issue 
in both cases comes down to showing that under suitable assumptions on 
the proposition systems such imbeddings do not exist. 

In Z-S it is assumed that the set of experimentally verifiable quantum 
propositions is partially ordered, has 0 and 1, and an orthocomplementation 
1:9 > (0'=1, 1+=0,a<b=b' <a, at+=a, avat=1,aat=0) which 
is weakly modular (a <b > a+ Ab exists and b=av(atAb)); it is not assumed 
that Z is a lattice. As usual if a <b+ we say a and b are orthogonal; sums of 
mutually orthogonal elements exist in Z. If x, yeZ they are said to 
commute if we can write x=2,Vz, y=Yy,Vz where x, y,, and z are mutually 
orthogonal; this is possible if and only if x and y are contained in a Boolean 
subalgebra of Z (recall that B < # is a Boolean subalgebra if 0, 1 e B; if 
c,a,beB=cteB, avb, anb exist in Z and belong to B; and if B becomes a 
Boolean algebra with respect to v, A, L). If B is a Boolean algebra, an 
imbedding of Z into B is a map h (F > B) such that 


(a) A(0)=0, A(1) =1; 
(b) 2<yh(x) < hly); 
(4) (c) A(x) = h(x*); 
(d) A(zvy) = h(x) v hy); 
h(x Ay) =h(x)Ah(y) ifx and y commute. 


Actually it is enough to require (b), (c) and the single set of relations 
h(x +y)=h(x)vh(y) in (d). Z-S prove that if Z contains a copy of Z (R3), 
then # has no imbedding into a Boolean algebra; indeed, it follows from 
Stone’s theorem that any Boolean algebra has {0,1}-valued probability 
measures, so that such imbeddings would give rise to a {0,1}-valued prob- 
ability measure on X (R?). Z-S also proved that for arbitrary Z, if one 
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assumes that there are imbeddings into Boolean algebras which satisfy 
(d) of (4) in the following stronger form 


(5) (d’) A(avy)=A(x)va(y) ifx vy exists in &, 


then xvy will exist in # only when x and y commute; thus if f is in addition 
a lattice (e.g., a logic), such an imbedding will exist only when Y is already 
a Boolean algebra. 

The analysis of K-S is along similar lines but it is formulated in the 
language of partial Boolean algebras. A partial Boolean algebra is a set L 
with a family B ={B} of subsets of L with the following properties: 


(a) each Be B has the structure of a Boolean algebra; if B, B'e B 
and B € B’ (as sets), B is a Boolean subalgebra of B’; 


(b) given B, B’e BS, BOB’ cB; 
(c) each element of L is in some Be B; 


(d) ifa,,...,a@,¢Land any two of them are contained in some member 
of B, there is a member of B that contains all of them. 


It follows easily from these axioms that L has the following properties: 


(a’) the unit and null elements of all members of B are the same; we 
write 0 and 1 for them; 


(b’) ifaeL there is a unique a+ e L such that a+ is the complement of 
(7) ain any Be 8 that contains a; 


(c’) write, for a, be L, a <b to mean that a and b are in some BEB 
and satisfy this relation there; then < is a partial order on L; 
(d’) with respect to (<,1), L becomes orthocomplemented and 
weakly modular. 
Obviously, if we start with an orthocomplemented weakly modular L its 
Boolean subalgebras will give L the structure of a partial Boolean algebra 
if (d) of (6) is satisfied. A morphism of a partial Boolean algebra (Z,8) into 
another (L’,8’) is a map h:L—-L’ such that for any Be, there is a 
B'e’ such that h(B)cB’ and h:B->B' is a morphism of Boolean 
algebras. Imbeddings are injective morphisms. 

For K-S the basic assumption is that the set Y of experimental propo- 
sitions is a partial Boolean algebra, and as we mentioned earlier, their 
interpretation of the existence of hidden variables will imply the existence 
of an imbedding of ¥ into a Boolean algebra. They prove that the existence 
of such an imbedding is equivalent requiring that if a, b e Z and a £b, there 
is a {0,1}-valued probability measure h on # such that h(a) £h(b). They then 
proceed to construct explicitly a finite subset F of (R3) with the property 
that the partial Boolean subalgebra of “(R*) generated by F does not 
admit a single {0,1}-valued probability measure. 
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The case of the logics of the two-dimensional spaces has been completely 
excluded in the above discussion. The spin observables of an electron are an 
example of such a system. Here the Hilbert space # = C*. If 


0 1 0 —2 1 0 
8 ~— — paaa 
(8) 74 h a) ' 72 (, a) ? 73 (o — ) 


are the Pauli spin matrices, then for any @ = (a,,d2,@,) in the unit sphere S? 
of R3, the self-adjoint operator @:¢=a,0, +4,0,+4,0, corresponds to the 
spin component along the a@-direction; its spectral projections are }(1 +a-c), 
corresponding to the eigenvalues + 1. The map a->}(1+a- g) is a bijection 
of S? with the set of (Hermitian) one-dimensional projections of #, and 
gives rise to a bijection p>f, between the set of probability measures p on 
L(H#) and functions f: S?— [0,1] such that 


(9) f@20, fi@)+f(-a)=1 (eS), 


with p(}(1+a-c)) =f (a). The quantum states, i.e., the states in the con- 
ventional model of Quantum Mechanics are the functions g? (6 e 82) defined 
by 

(10) gz (a) =4(1+ a 5) (-=scalar product in R3). 


It is clear that the functions (10) form only a small part of the convex set of 
functions (9) (even if we assume they are Borel). The extreme points of the 
latter set are the functions that take only the values 0 and 1, the dispersion 
free states; they are the characteristic functions 14 where A <S? is any 
subset with AU(—A)=S*, AN(—A)=Q@. This suggests that we can 
express the states (10) as mixtures of suitable families of dispersion free 
states and hence that a classical imbedding may exist for the quantum 
system associated to #. This is in fact so, and the details were first given 
in articles of Kochen-Specker (loc. cit.) and J.S. Bell (On the problem of 
hidden variables in Quantum Mechanics, Rev. Mod. Phys., 38 (1966), 
pp. 447-452). In what follows we shall describe briefly the construction of 
Kochen-Specker; Bell’s example is built along similar lines. 

The space of “hidden variables” is 82. Its points # define dispersion free 
states fz; for each self-adjoint operator A ¢B(#’), there corresponds a 
function F 4 on S?; and for each pure quantum state q, there corresponds a 
probability measure p, on S*. The F4 and p, are related by 


(i) Fua = u(F'4) for any Borel function u, 


(i) 9(A)= f, Patty 


The relation (i) asserts that the functional relationships among quantum 
observables are preserved under the map A —> F 4, and relation (ii) asserts 


(11) 
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that the probability distribution of A in the state q is that of F4 with 
respect to ua. If A =4(AI +7-c) (Ae R) (having eigenvalues 4$(A + 1)), 


(12) Fa = }(A+1) izt +4(A—1) late, 
where 13.35, is the characteristic function of the set of Ùs with w-r>0, and 
so on. It is immediate that the assignment (12) possesses the property (i) 
of (11). If qz is the quantum state (10), the associated probability measure 
pe=p,7 is defined by 
(13) du? = 4(5-%) 13.25 di, 
where dw is the surface measure on S?, given by dw = (1/47) sin 6d0 dq in 
the polar coordinates 

Ù = (sin 0 cos p, sin Osin p, cos p) (0 <0 < 7,0 < p < 27). 


The relation (ii) of (11) then follows from the formula 


(14) 140-7 Í 4(b-i) dè. 
2 3-P>0,b-t>0 


Finally, for each ù let fz(@ > fz(&)) be the function which is 1 when %-a>0, 
0 when w-a<0, and which is defined so satisfy (9) when #-a=0. Then fz 
defines a dispersion free state on Z(H), and the formula (14) becomes 


(15) q= [ fodur) 


exhibiting qz as a mixture of dispersion free states; the absence of explicit 
definition of fz(7) when w-r=0 does not matter, since this set has measure 
0 in w. 

We now turn our attention to the analysis from the hidden variables 
point of view, due to Bell, of the Kinstein—Podolsky-Rosen example 
(J.S. Bell, On the Hinstein—Podolsky-Rosen paradox, Physics, 1 (1964), 
pp. 195-200); actually Bell considers the variant, due to Bohm and 
Aharanov, of the EPR example. The system consists of a pair of spin 4 
particles and only their spin observables are of interest. Thus the Hilbert 
space is (cf. the remarks below in the discussion of composite systems) 


A = C2 @ C?, 
The particles are in the “singlet” state defined by the state vector 
1 
16 Ð = — tO ~—o-~ & +), 
(16) Np (p+ © p-p © p+) 
where g+ are the spin states of eigenvalues + 1, say, in the z-direction. For 


instance, the spin parts of the states of the two electrons in the Helium atom 
or the H,-molecule are in this state (which is the unique rotation invariant 
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antisymmetric state); even if the electrons are stripped away by dipole 
radiation and become widely separated spatially, (16) describes to a good 
approximation the compound state of the two electrons (cf. the discussion 
of E.P.Wigner in Interpretation of Quantum Mechanics, in Quantum 
Theory and Measurement, edited by J.A.Wheeler and W.H. Zurek, 
Princeton University Press, Princeton, NJ, 1983, pp. 260-314, especially 
pp. 291-293). Let X,,3 and X23 be the observables measuring the spin 
components of the first and second particle in the direction a. In Quantum 
Mechanics they are represented respectively by the operators @-¢@ 1 and 
1@a-c. Let us write Pox (a, b) for the expectation value of X,2°X23 
under the state ®: 


Pon (a,b) = (4-3 @b-6) D, 0). 
A simple calculation gives 
(17) Pox (4,6) = —å-b = — cos 0, 


where @ is the angle between @ and b. The question studied in loc. cit. by 
Bell is whether the correlated statistics of the two particles in the singlet 
state that is summarized by (17) can be obtained from a hidden variables 
model that is local in the sense introduced by Einstein, Podolsky, and 
Rosen in their famous paper, namely, measurements on either particle do 
not disturb the other. Bell’s analysis showed that the correlations arising 
from any local hidden variable model must satisfy certain limitations that 
conflict with (17). For nonlocal models, however, these inequalities may be 
violated (cf. M. Flato et al., Helv. Phys. Acta, 48 (1975), pp. 219-225). 

The model consists of a probability space X of points A, and a probability 
measure dà on X; moreover, the assumption of locality enables us to say 
that there are random variables Ax(a: à) and A,(b: A) corresponding to 
X,,z and X2,3, respectively, for a, beS8?, The A (@:A) are +1 valued. The 
correlations are defined by 


Puy (à, b) = [ Auda) Aber dd, 


> 
>; 


where the suffix stands for hidden variables. If å, b, a’, b’ are four points of 
S? then (suppressing 4) 


[Pav (35) — Pe 8] < | [4s Ay) (14 As’) 46) a 


+ lat AO) (1540) A Öna 


< (1 + Pav (a' b)+ F Puy (a,b). 
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So (cf. M. Lamehi-Rachti and W. Mittig, Phys. Rev. D, 14 (1976), pp. 2543- 
2555) we obtain the inequality 
(18) | Pay (b) — Pnv (@6")| + [Puy (0) + Pray (2’,6)| < 2, 


which must be satisfied by the correlations in any local hidden variables 
model. In Quantum Mechanics, Poy (a,a)=—1; Pay satisfies this if and 
only if 


(19) A,(a:A) = —A,(a:d). 
If we assume (19), then (18) becomes 
(20) |Pav Èb) — Pav (a,¢)| < < 1+Pav (6, ô). 


This is Bell’s inequality. It is trivial to check that (17) violates (20). Indeed, 
if 0,5, 453, 93; are the angles between a, b, b, č, and é, a, then for (17) to obey 
(20) we should have 


sin? (013/2) + sin? (833/2) > sin? (031/2), 


which is false if 0,.=98,,=7/3, 05, = 27/3. 

If we assume the model to have natural covariance properties with 
respect to rotations, nen Pay (a; b) depends only on the angle @ between a 
and 6 say Puy (a, b) = P ny (8). Obviously, 


Pay (9) = Pay (—8) = — Pav (0+7). 


The inequalities (18) now imply bounds for Pyy(4). Indeed, taking a’, B 
coplanar with a, b we get, writing P for Pav, 


(21) |P(8) —P(8+a)|+| P(A’) +P(8' +a)| < 2, 
for all 0, 6’, a from which one gets (cf. Lamehi-Rachi and Mittig, loc. cit.), 
(22) |P(7/6)| <3, |P(7/4)| <4, [3P(7/3)+P(0)| < 2 


The article of Bell attracted the attention of physicists and led to experi- 
ments whether (22) or (17) is true (cf. Wigner loc. cit.). The experimental 
results so far support Quantum Mechanics (see some of the experimental 
evidence presented in the articles of the collection edited by Wheeler and 
Zurek, loc. cit.). 

Even though the basic Hilbert space has dimension 4 here, the above 
treatment of the hidden variables question cannot be subsumed under the 
von Neumann-—Mackey-—Gleason paradigm. The point is that for each Àe X, 
the model gives only the values of Exp(X,3"-X23") (m,n > 0); and 
hence each Ae X can define a dispersion free state only on the 1 maximal 
abelian subalgebras (a, b) of # generated by 4'o&1 and 1@b-3. Hidden 
variable theories such as these are called contextual in the book of Beltra- 
metti and Cassinelli (cf. notes on Chapter IIT). The reader who wants to 
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know more about these and get a sharper perspective on the entire question 
of hidden variables cannot do better than consult the book of Beltrametti 
and Cassinelli, for its thorough treatment of related issues as well as the 
references it contains to other literature. 

For a question as deep as the one that asks whether quantum mechanical 
description of the atomic world is complete, a mere mathematical analysis 
will hardly provide a completely satisfying answer. The argument must also 
be based on analyzing possible measurements and the limitations imposed 
on them by the uncertainty relations. The paper of Einstein, Podolsky, and 
Rosen, and Bohr’s refutation of it, offer perhaps the most profound 
discussion of this question from the natural philosopher’s point of view 
(cf. the relevant articles in the collection edited by Wheeler and Zurek, 
loc. cit.). 


2. Algebraic formulation of Quantum Mechanics. In some sense the 
algebraic method of viewing the foundations of Quantum Mechanics is the 
oldest of all approaches, and arguably, the most flexible one. The early 
papers of Born, Jordan, Heisenberg, and Dirac repeatedly emphasized that 
quantum observables are represented by matrices and so form a noncom- 
mutative algebra, in sharp contrast to the commutative algebra of classical 
observables (for a historical account as well as translations of the early 
papers, see Sources of Quantum Mechanics, B. L. van der Waerden, Dover, 
New York, 1967). Eventually this came to be formalized as the principle 
that the quantum observables are in one-one correspondence with the self- 
adjoint operators on a (separable, complex) Hilbert space. In spite of the 
overwhelming success of the models built out of this assumption it was 
always understood that this was a very ad hoc solution to the mathematical, 
physical, and philosophical problems raised by atomic physics. Especially 
troublesome was the fact that both the operations of addition and multipli- 
cation involved possibly noncommuting (hence nonsimultaneously 
observable) observables, thus making it very difficult to give any con- 
vincing “explanation” for them. (Perhaps one should not put too much 
faith in such “explanations”. Most explanations make implicit or explicit 
use of classical perceptions and are not too relevant in the atomic realm.) 
There was also the fact that every self-adjoint operator was supposed to 
represent an observable; in the presence of superposition rules one should 
modify this suitably, but even then this was gradually seen as a far-reaching 
enlargement of what can be effectively observed in atomic systems—see 
Interpretation of Quantum Mechanics, E. P. Wigner, in Quantum Theory and 
Measurement, edited by J. A. Wheeler and W. H. Zurek, p. 298. Multipli- 
cation was felt to present the greater problem since it could be argued that 
if A and B are observables, A + B can be effectively defined by prescribing 
that its expected value in any state is the sum of the expected values of A 
and B in that state. This, of course, is a little deceptive, for, as we have 


LOGICS ASSOCIATED WITH HILBERT SPACES 133 


already remarked above, the linearity of expectation values is a deep 
property of the propositional logic; its validity for the standard logics (in 
dimension > 3) is a consequence of Gleason’s theorem while its truth for the 
more general logics of projections in von Neumann algebras has only 
recently been proved (see the remarks on Gleason’s theorem, and also 
§14.5 of the book of Beltrametti and Cassinelli, cited in the notes to 
Chapter III). 

The first systematic investigations on the algebraic structure of the set of 
observables are due to Jordan, von Neumann, and Wigner; the initial papers 
are those of Jordan in Gott. Nachr. (1932), pp. 569-575, Gött. Nachr. (1933), 
pp. 209-217, and Z. Phys., 80 (1933), pp. 285-291, and that of Jordan, 
von Neumann, and Wigner in Ann. Math., 35 (1934), pp. 29-64. One of the 
motivations for these studies was the feeling that the generalizations of 
Quantum Mechanics to relativistic and nuclear physics might force such a 
more general view of observables. Jordan’s basic observation was that 
although the set of bounded self-adjoint operators do not form an algebra 
under multiplication, it is nevertheless an algebra over R, even abelian, if 
we define a new product by A : B=}4(4B + BA). This product is bilinear and 
satisfies the distributive laws A:(B+C)=A-:B+A:C, although not the 
associative law; moreover, the identity A-B=4[(A + .B)?— A? — B?] shows 
that this product can be obtained from the linear and power structures. 
Nowadays such algebras, emerging as they did out of the pioneering studies 
of Jordan, are known as Jordan algebras; see the book of Jacobson, The 
structure of representations of Jordan algebras, Amer. Math. Soc. Colloq. 
Publications, vol. XX XIX, 1968, Providence, R.I. 

Typically, one starts with an associative but not necessarily commutative 
algebra and define the new multiplication by a-b=4(ab+ ba) to obtain a 
Jordan algebra. But it is not always possible to obtain a given Jordan 
algebra in this fashion. The fundamental result of Jordan, von Neumann, 
and Wigner in their 1934 paper is the complete classification of all finite 
dimensional Jordan algebras A over R which are irreducible and formally 
real, i.e., satisfy 


Qis -An EA, a +... +a = 0 > a, =... =a, = 0. 


It follows from their classification that, with one exception (that of 2,8, 
the algebra of self-adjoint 3 x 3 matrices over the Cayley numbers), all of 
these are obtained in the above-mentioned manner from associative 
algebras; and further, that if some special cases are disregarded, one gets 
just the Jordan algebras of Hermitian matrices over R, C, or H. 

The finite dimensionality assumption is a very severe one and von 
Neumann began the study of infinite dimensional topological algebras 
(Mat. Sb., 1 (1936), pp. 415-484; Collected Works, Vol. III, No. 9, Pergamon, 
Oxford, 1961). His investigation in this paper contains a deep analysis of 
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the spectral theory of individual elements and its relationship to simul- 
taneous observability. Although he had, in connection with his analysis of 
the possible existence of hidden variables (cf. his book), introduced the 
notion of the state of a system as a positive definite linear form on the 
observable algebra, this paper does not contain a study of the states. 

With the appearance, in the early 1940s, of the path-breaking papers of 
Gel’fand and Naimark on Banach algebras, the essential aspects of this 
approach became clearer. This theme was taken up by I. E. Segal in 1947 
(Postulates for general Quantum Mechanics, Ann. Math., 48 (1947), pp. 930- 
948). Segal’s postulates on the observable algebra were simpler and more 
direct than von Neumann’s, and were motivated by what is operationally 
possible; in addition, he studied in detail the structure of the states. 

The case when the (bounded) observables are the self-adjoint elements of 
a complex C*-algebra (a complex Banach algebra with involution * such 
that ||a*a| = ||2||? = ||a*||?, ||1||=1) is a possible system in the sense of Segal. 
Although special, it is an important one since it includes the conventional 
model of Quantum Mechanics. If 4 is the C*-algebra whose self-adjoint 
elements are in correspondence with the bounded observables of the system, 
a state of the system is a linear function f on Uf such that f(1)=1 and 
f(a*¥x)>0 for all veU. The Gel’fand-Naimark-Segal representation 
theorem associates to f an essentially unique triple (#,,77,,,) consisting of 
a Hilbert space H, a unit vector %, in #,, and a *-representation 7, of U 
in #, with , as cyclic vector such that f(x) =(7,(x)p,,,) (ce U); fis pure 
if and only if 7; is irreducible. In this way state vectors are once again unit 
vectors in Hilbert spaces; but the Hilbert space is not fixed as in con- 
ventional Quantum Mechanics, but varies with the state. The importance 
and usefulness of these more general types of observable algebras become 
clear in the theory of Second Quantization where one works with systems 
of identical particles (photons, electrons, He’, He‘, etc.), but where the 
number of particles may not remain fixed. In this case it turns out in 
contrast to the case of systems of a fixed number of particles, that the 
canonical commutation rules (or anti-commutation rules) have many 
essentially different representations. Nevertheless, it was proved by Segal 
(Mat. Fys. Medd. Dan. Vid. Selsk, 31 (1959), No. 2) that the C*-algebra, 
which is the norm-closure of the observables depending only on finitely 
many particles, is canonically determined; therefore it may be regarded as 
the algebra of field observables. The possibility that one may have to vary 
the Hilbert space with the state has also been discussed by Dirac (Lectures 
on Quantum Field Theory, Belfer Graduate School of Science Monograph 
Series, No. 3, Belfer Graduate School of Science, Yeshiwa University, 
New York, 1966). 

The study of infinite dimensional observable algebras has been pursued 
in recent years in another direction, closer to the original theme of Jordan, 
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von Neumann, and Wigner. This is the study of what are called JB, JBW 
algebras, which are the Jordan algebra analogues of the self-adjoint parts of 
C*-algebras and von Neumann algebras. The state spaces of these are 
compact convex sets with an interesting geometric structure, and they can 
be characterized in an intrinsic manner. This thus opens up another 
approach to the foundations in Quantum Mechanics. For a survey of many 
aspects of this theme we refer the reader to the article of E. M. Alfsen (On the 
state spaces of Jordan and C*-algebras, in Algebres d’operateurs et leurs 
applications en Physique Mathematique, CNRS Coll. Intern. No. 274, 1979, 
pp. 15-40). 

In recent years people (Haag, Kastler, and others) have used algebras of 
operators for formulating some general properties of physical systems (for 
example, a quantized field) extended in space-time, that are local in the 
sense that the observables that are localized in noncausally connected 
regions commute with each other. These ideas have clarified somewhat the 
nature of the high-energy scattering processes that dominate elementary 
particle physics. The reader who wants to understand the algebraic 
approach may start with the book of G.G.Emch (Algebraic Methods in 
Statistical Mechanics and Quantum Field Theory, Wiley, New York, 1972), 
and the references cited there. It should be remarked, however, that the 
mathematical questions arising in the attempt to construct rigorous models 
of scattering processes involving particle creation and annihilation 
(Quantum Electrodynamics, for example) are quite deep and are tied up 
with themes of an entirely different sort (cf. J. Glimm and A. Jaffe, Quantum 
Physics: A Functional Integral Point of View, Springer-Verlag, New York, 
1981). 

The approach to Quantum Mechanics via functional integrals was 
pioneered by R.P. Feynman [2] (cf. also his book with A.R. Hibbs, 
Quantum Mechanics and Path Integrals, McGraw-Hill, New York, 1965) 
and is known as the path integral formalism. It is widely used by physicists 
although mathematically difficult to justify. The attempts to understand 
this formalism at a rigorous level have proved very beneficial and effective 
in the mathematical theory of elementary particles and their interactions; 
see the book of Glimm and Jaffe, loc. cit. Finally, for a view of the founda- 
tions of Quantum Theory that is very close to phenomenology one should 
refer to the series of papers of J.Schwinger which are a part of his book 
Quantum Kinematics and Dynamics, W. A. Benjamin, New York, 1970. 


3. Statistics of mixed states. Consider a system governed by the standard 
logic Z(H) where # is a separable complex Hilbert space, and let it be 
in the state py where U is a von Neumann operator on #. By the 
spectral theorem we have U=)),A,P1, where 2,,A,,... are the distinct 
eigenvalues >0 of U, L, the eigensubspace corresponding to Àn, and Pz, 
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is the orthogonal projection #->L,. If d,=dim(L,), Wn =d,A,, and 
U,=d,"P_z,, then tr(U,)=1, 2unWa=1, and the formula 


(23) Pu = ` Wna PUn 
n 


exhibits py as a canonical mixture of the states pu, with weights w,, the 
supports of the pv, being mutually orthogonal. If U has a simple spectrum, 
i.e., if d,=1 for all n, the pu, are all pure states; and (23) is the unique 
decomposition of py as a convex combination of pure states defined by 
orthonormal vectors. If some of the d, are >1, the right side of (23) can be 
split further but not uniquely; indeed, if (Wai), <<a, iS any orthonormal 
basis of L, we have 
(24) Pu=VWn LY An’ Dug 
n 181<dn 
which exhibits py as a convex combination of pure states defined by the 
orthonormal family of vectors (w,,). Thus, as soon as some d, > 1 we cannot 
definitely say of which pure states pv is a mixture of; the phenomenon is 
clearly nonclassical and is another example of the sharp contrast between 
quantum and classical statistics. 


4. Composite systems. One of the most widely used methods of analyzing 
complicated systems is that of viewing them in terms of suitable subsystems 
and their interactions. For example, for many purposes it is completely 
adequate to view an atom as a system of electrons moving in a centrally 
symmetric force field. We are thus led to the general problem of composition 
of systems. Let ©, G,,...,Gy be quantum mechanical systems; in con- 
ventional Quantum Mechanics we associate complex separable Hilbert 
spaces #,...,#n, to them with the usual prescriptions of states and 
observables. It is then possible to form a composite system © containing 
©; as subsystems in a natural manner, and in fact to do this in a universal 
way. The logic of © is #(#) where 


KH =H 1®...@HN. 
The map 


ty: 471®@...@A®...@1 (1 <a<N,A in the ath factor) 


imbeds the algebra B(#,) in B(#); by restricting the A’s to the pro- 
jection operators, each i, gives rise to an imbedding 


i: LIL) S LA) (1<a<Q). 


Thus the properties of the ©, are faithfully reflected within G. If ø is a state 
of G, i.e., a probability measure on L (X), then 


(25) O, = 001z 
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is a probability measure on S (Ha) and so defines a state of ©, the state 
that Š, is in when GS is in the state o. The map 


(26) o —> (T1, ON) 


is clearly surjective; if o,’=pu, where U, is a von Neumann operator on 
A, and r=pu where U=U,®...@Uy, then roi, =o,’ for all a. In general 
the o, do not determine o uniquely; it can be proved that this is so 
if and only if at most one of the o, is nonpure (cf. von Neumann [1], 
pp. 425-429); in particular, if u,e W, is a unit vector, o,=p,,, and 
U=U,®...@Uy, Pu is the unique state of (W) such that p,ot,=p,, for 
all a. However, if we # is a unit vector which is not in this “ split” form, 
some of the states p, 7, will not be pure, thereby illustrating the difference 
between the quantum and classical compositions. 

As a simple example let N =2, #,=#, = C*; we can think of each ©; as 
a pure spin system of spin 3. If {y+,~-} is an orthonormal basis in C2 which 
corresponds to the + 1 values of the spin in some (fixed) direction in space, 


(27) p = -p(t O@r-9 gt) 


represents the so-called “spin singlet” state of the particle pair. The states 
O, = Po ° la (a=1,2) are then both mixed, being equal to 4(Pp+ +p -). The 
probability of having spin parallel to any given direction is 4 for each of the 
particles; but the spins are so correlated that, given that the first particle 
has spin along the direction a, the probability is 1 that the second particle 
has spin along —d@. This example is a variant due to Bohm and Aharanov 
(Phys. Rev., 108 (1957), p. 1070) of the example used by Einstein, Podolsky, 
and Rosen in their famous paper (Phys. Rev., 47 (1935), pp. 777-780). By 
measuring the spin of the first particle in any direction a we can predict 
with certainty the spin of the particle along & without apparently dis- 
turbing its state, a paradoxical result since the spin components along 
different directions are not simultaneously observable. The paradox arises 
because of the assumption (which ultimately involves preconceptions about 
the structure of matter, among other things) that a measurement of the 
spin state of the first particle does not affect the state of the second when 
they are spatially far apart; the paradox disappears if one realizes that 
when the composite system is in a state such as (27), a measurement of the 
spin state of the first particle changes the state of the composite system, and 
consequently also the state of the second system (cf. the detailed analysis 
of the situation in Bohr’s reply to the EPR paper, Phys. Rev., 48 (1935), 
pp. 696-702). 

The theory of composition of systems in contexts other than the standard 
ones is more involved. For instance, since there is no reasonable way to 
form tensor products of vector spaces over noncommutative division rings, 
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the theory of composite systems in quaternion Quantum Mechanics 
becomes more difficult. For quaternionic Quantum Mechanics, the im- 
possibility of forming tensor products is interpreted by Finkelstein et al. [1] 
to mean that there are no truly independent systems, i.e., that there are 
complementarity relations between observables of any two systems in 
quaternionic Quantum Mechanics. The issue becomes even more compli- 
cated when we treat it in the framework of general logics. The reader is 
referred to the discussion in Chapter 24 of the book of Beltrametti and 
Cassinelli (cf. Notes on Chapter ITI) and to the literature cited there. 

The above considerations have to be modified substantially when une 
wants to treat systems of identical particles, because, in Quantum Mechanics 
there are no experiments that can keep track of individual particles 
throughout the course of some physical process. Thus, in such assemblies of 
identical particles, only the physical quantities that involve the various 
particles symmetrically are permitted to be observables. Mathematically, 
this means the following: if # is the Hilbert space of a single particle, then 
for a system of N such particles, the self-adjoint operators on 


HAH = HKO... OQR (N times) 
that represent physical observables must commute with the natural action 
of the permutation group Sy of {1,..., N} on 4); here we recall that any 
o € Sy acts on #) by sending w ®...® un to 


Ug—1(1) @ «++ @ g-un (Uj EH). 
Let U be the algebra of all operators on #7“) commuting with this action on 
Sy. To analyze the structure of U we proceed as follows. For any irreducible 
character 7 of Sy let E(7) be the projection operator 


(N !sdim(7)) © z(s)s on #0) 


seSy 
that projects onto the space of “‘r-symmetric” tensors; if 
Hr) = Els) (£), 
the #7) are mutually orthogonal, stable under Sy and Uf with 
(28) HM = (PHN); 


moreover, we can factorize each #™)(r) (but noncanonically), as 
(29) HMT) x Kr) @ Liz), 


where lf (resp. Sy) acts only on the first (resp. second) factor in such a way 
that Sy acts irreducibly on L(r) with character r while W is isomorphic to 
B(K(r))@1 (see Wey] [1]). Thus, if # is the logic of Sy-stable closed sub- 
spaces of #™), then the #)(r) are the atoms of the center of Y, the logics 
L(r)=LOAN\r7) are standard (being isomorphic to £(K(7)), and 
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L=H,L(r) is the central decomposition of Z. Although (29) is not 
canonical, once we fix such a factorization, for each 7, the pure states of Z 
may be identified with the rays of #™) that lie in some K(z), so that the 
K(r) are the coherent subspaces of pure states. The reader should view this 
as an example of noncommuting superselection rules. 

Two specific coherent spaces of pure states deserve special attention. 
These are the cases where r is one-dimensional so that K(r)=#™(r). More 
precisely, we consider the two cases: (a) r=1, 7(s)=1 for all seSy; 
(b) 7 =sign, 7(s) =signature of s for all s e Sy. In (a), #7) is the subspace 
of symmetric tensors ofrank N over #, or the subspace of degree N, SN HW), 
in the symmetric algebra S(#) over H. In (b), (7) is the subspace of 
skew-symmetric or alternating tensors of rank N over #, or the subspace of 
degree N, AW), in the Grassmann or Exterior algebra A(#’) over #. 
In atomic theory, in order to get agreement with experiment it is necessary 
to assume that for systems of N electrons no states other than those in 
A“) (3) occur in nature, i.e., the logic is the standard one associated with 
the Hilbert space AM( Æ). If u; (1 <i <N) are orthonormal vectors in #, 
it is customary to interpret u,A...Auwn as “the state in which the electrons 
(in some order) occupy the states u,,...,un’’. The fact that u,,A...Au,,=0 
if two of the 1,’s are identical is usually formulated as the Pauli exclusion 
principle, namely, “no two electrons can be in the same state.” In radiation 
theory which studies photon assemblies it becomes necessary to assume that 
for a system of N photons only the symmetric tensor states occur in nature, 
i.e., the logic is the standard one associated with the Hilbert space S")(#). 
In either case it is necessary to give additional prescriptions, typically by 
introducing specific operators, to describe physical quantities and processes 
characteristic of the systems of particles in question. The systematic way of 
doing this is known as Second Quantization. 

In particle physics it is assumed that the above description in terms of 
AW) #) or SH) extends to assemblies of particles of any type whatso- 
ever. Particles which require the exterior algebra description are known as 
Fermions and are said to obey Fermi—Dirac statistics, while the others are 
known as Bosons and obey Bose-Einstein statistics. Experiment has shown 
that particles with half integral spin (electrons, neutrons, protons, He? 
atom, etc.) are Fermions while particles with integral spin (photons, He‘ 
atom) are Bosons. All theoretical explanations of this remarkable connec- 
tion between spin and statistics depend on the theory of relativistic 
quantum fields. 


5. Measurement in Quantum Mechanics. The theory of measurement is at 
the heart of Quantum Mechanics. Indeed it is the uncontrollable disturbance 
caused by measurements on the physical systems being observed that leads 
to the statistical character of the quantum mechanical picture of the 
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physical world. It is natural to try to determine the extent to which one can 
understand the nature of the measurement process within the framework of 
Quantum Mechanics itself. Of course Quantum Mechanics gives unam- 
biguous prescriptions for calculating the probability distribution of any 
physical quantity at a given instant of time, if the state of the physical 
system is given at an earlier time (Schrédinger’s time-development 
equation), so that, no theory of measurement is needed to ensure the 
logical consistency of the theory. But the state vectors are not physical 
objects; they are idealizations, and one has conceptual access to them only 
through results of experiments. Thus, if the theory is to be confronted with 
experience, and is to be used as a guide for understanding existing phenom- 
ena as well as for predicting new ones, it has to contain instructions that 
tell us how to determine the states from physical data, i.e., how measure- 
ments on a system change its state. 

The fundamental work on the general theory of measurement is that of 
von Neumann, most of which is contained in Chapters V and VI of his 
book [1]. Since then, the subject has received a lot of attention and new 
themes have been introduced. We shall limit ourselves to a brief discussion of 
von Neumann’s work and some of the later developments inspired by it. 
The reader who wants to get a historical perspective and a deeper insight 
into these questions should refer to the volume edited by Wheeler and Zurek 
(Quantum Theory and Measurement, Princeton University Press, Princeton, 
NJ, 1983) that collects together most of the crucial papers on the subject. 

For simplicity we shall restrict ourselves to the conventional model of 
Quantum Mechanics. Let a quantum mechanical system be in a pure state 
represented by the normalized vector 9 (of the Hilbert space # underlying 
the system), and let A be a self-adjoint operator with a purely discrete 
simple spectrum that represents some observable. If a,,a,,... are the 
eigenvalues of A and 9,,9,... are the corresponding normalized eigen- 
vectors, then a measurement (instantaneous) of A will yield the values 
a1, az... with probabilities |(p,7,)|*, |(~,2)|?, .-. However, if a particular 
value, say a@,, is obtained, and the measurement of A is again carried out 
immediately after, the value does not change. This is interpreted in the 
following way: the act of measurement has changed the state of the system 
from 9 to pr. Since the value a, is obtainable only with probability |(9,9,)|?, 
one may then summarize this set of circumstances by saying that the 
statistical effect of the measurement of A consists in changing the state of 
the system from the pure state g to the mized state in which the pure states 
Pis Pa- occur with probabilities |(p,7,)|*, |(~,y2)|?, .... If the initial state is 
not pure but mixed and U is the von Neumann operator representing it, the 
operator of the changed state is 
(30) Ua= 2 (UPn Pn) Pron 


n> 
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where Piy, is the orthogonal projection on the one-dimensional span of ¢,, 
(for the discussion of the Compton—-Simons experiment of light scattering 
by electrons on which this assumption is based, see von Neumann [1], 
pp. 212-214). Thus the act of measuring A induces the map 


(31) Ma:U->U4 


on the convex set of states. It is clear from (30) that M 4 is a convex map; 
but it is a convex endomorphism and not a convex automorphism since in 
general it changes pure states into mixed ones. The essential content of von 
Neumann’s work is a profound analysis of the mathematical and physical 
nature of the endomorphisms of the type M a. 

The first question is to what extent M 4 fails to be invertible. Von 
Neumann viewed this from the more general point of view of deciding 
whether (the effect of) M 4 is irreversible. To this end, he introduced also 
the automorphisms of the convex set of states of the form 


(32) ap: U > DU D-t; 


these are induced by unitary operators D of # that represent the dynamical 
transformations. He then formulated the irreversibility of M 4 as the fact 
that it is impossible to go from M 4(U) to U by repeated applications of 
transformations of the form M Bg, ap. For proving this he introduced, for 
any U, its entropy H(U): 


(33) H(U) = —xtr(UInU), x > 0 being Boltzmann’s constant. 
Then he proved that 


(34) A(U) 20, H(U)=0<U =P, forsome gez with |p| = 1, 
and further that 

(35) H(ap(U)) = H(U), 

(36) H(M4(U))> H(U) unless Ma(U) = U. 


The irreversibility of the transition U > U , follows from (35), (36). 

As remarked by von Neumann himself, there are many unitary invariants 
besides H (U) that do not decrease under the measurement transition U >U 4. 
For instance, the function that assigns to U the value —A where A is the 
largest eigenvalue of U, has this property. But as we shall explain briefly 
below, the entropy function H has thermodynamic significance; and the 
relations (35)-(36) show that the measurement transition U>U, is 
actually irreversible even at the thermodynamic level. Since it is easy to 
construct examples of changes U -> U’ where the entropy stays the same 
but the largest eigenvalue decreases strictly, we see that there are thermo- 
dynamically reversible changes which cannot be brought about by repeated 
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applications of M B, ap. Thus the inequalities (36) go far beyond merely 
establishing the mathematical irreversibility of the transitions U > U 4. 

Von Neumann was led to the definition (33) of entropy by his analysis of 
ensembles of noninteracting systems each of which isin the state represented 
by the (von Neumann) operator U. He used the methods of phenomeno- 
logical thermodynamics for his purpose. The only difference between the 
classical setting and the present one is that the individual systems are now 
governed by the laws of Quantum Mechanics. The problem was to calculate, 
for an ensemble of N systems in the state U, the decrease in entropy that 
should be achieved to bring the systems to a pure state p,. He found that 
the pure states are transformable among themselves without any change of 
“heat energy” and so all ensembles where the individual systems are in a 
pure state p, have the same entropy, which can be normalized to be 0. He 
then showed, assuming the validity of the first and second laws of thermo- 
dynamics, that the ensemble of N systems with state U requires a decrease 
in entropy equal to —Nx«tr(UInU), k being Boltzmann’s constant, in 
order that it may be transformed to the pure state p,. Thus the entropy of 
the original ensemble is 


(37) ~Nxtr(UInU). 


The reversal of the processes used here shows that two ensembles with the 
same entropy can be transformed into each other by thermodynamically 
admissible processes. This analysis, which led to the formula (37) for the 
entropy, also suggested the properties (33)—(36). 

The relation (35) asserts that the entropy remains constant during time 
evolution if no measurements are carried out. This will appear paradoxical 
because classically the entropy always increases. The paradox disappears if 
we realize that the entropy H(U) is the entropy of the microstate. The 
classical entropy is a macroscopic quantity and its time variations are due 
to the fact that the observer does not know the microstate of the system. 
To define an analogue of the classical entropy one may introduce a Boolean 
o-algebra IN of “macroscopic projections” that form the logic of macro- 
scopic quantities. If we assume that Jt is atomic and its atoms FH), Ez, ... 
are finite dimensional projections where 


(38) Sn = dim(H,) > 1, 
then it is reasonable to define the macroscopic entropy Hoy(U) of the state 
U by 


(39) H(U) = -Z tr(UB,) n=), 


Von Neumann proved that 
(40) Ho(U) > H(U), 
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with equality if and only if 
(41) U => (tr(UL,)/8n) En. 
n 


Unlike the previous case, the macroscopic entropy is always >0 for any U. 
Moreover, it will change under time evolution. Von Neumann investigated 
this as well as the associated ergodic theorems in his paper in Z. Physik, 57 
(1929), pp. 30-70, to which we refer the reader for further details. 

The second aspect of the work of von Neumann is the investigation of 
the maps M 4 from the point of view of the Quantum Mechanics of the 
composite system formed by the system © under investigation and the 
system G’ consisting of the “measuring apparatus.” Let # and #’ be the 
Hilbert spaces associated with them and let Æ =H OX". Let A be as 
before a self-adjoint operator with a purely discrete spectrum, @,,dp,..., 
Pis Pg,--- being the corresponding eigenvectors (normalized). The act of 
measurement of A in © may be regarded as a “temporary” insertion of an 
energy coupling in the composite system that allows ©’ to interact with ©. 
The time development of the composite system in the (short) time interval 
of measurement is then given by the dynamical group t-> exp(—7#) where 
E is the energy operator in #. If é is a unit vector in #’ representing the 
initial state of the measuring apparatus, and is a unit vector in # repre- 
senting the initial state of ©, the state of the composite system after 
measurement is exp(—%7H)(y@&) where 7>0 is the duration of measure- 
ment. This state induces a state of G which may then be regarded as the 
state of S after measuring A. To make sure that this scheme captures at least 
some of the key aspects of measurement it is natural to require that the 
following conditions be satisfied. 


(i) There is an orthonormal set (€,),>, in #’ such that the state €, 
of G’ “corresponds” to the state 9, of G. 


(ii) For all pe, 
exp(— 17H) (9 @ E) = X calp) Pn @ En, 
with ” 
|enlp)| = |(P:Pn)]. 


The pairing 9, + £n is the mechanism by which the observer, recording the 
state £, of ©’, recognizes that © is in the state y,. The second condition 
shows at once that the state of S induced by exp(—17£)(p@6) is 


py (P,P) |? Py, = M Alpo). 


If for a given A we can find (€,)a>,, €, E, 7 such that (i) and (ii) are 
satisfied, then it would be justifiable to view the transitions U > M 4(U) 
of the states of G, which are irreversible within G, as the effects on © of the 
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perfectly reversible and continuous transitions provided in the composite 
system by its time development. In other words, the causal and continuous 
change of states arising out of the dynamical evolution gives one a unified 
quantum mechanical way of looking at the physical world and its inter- 
actions with the observer. Von Neumann proved that this is possible; that 
is, for any A, and for fixed 7>0, £, (En)n>1, there is a suitable E satisfying 
(i) and (ii) above. 

This result of von Neumann inspired much subsequent work whose focus 
was on the following issue: To what extent do the requirements (i) and (ii) 
above as well as the operator Æ constructed by von Neumann give a faithful 
picture of measurement processes encountered in the physical world? It 
could be argued that the description of ©’ by the Quantum Mechanics of 
the logic L(I’) does not take into account the fact that very often ©’ is 
“very big” compared to © and so the logic of G’ has a rich supply of super- 
selection rules. The requirement that measurement is of very short duration 
is also troublesome if these questions are to be treated relativistically. 
Finally, if there is additional structure in #, #’, and H , it is not clear that 
the Æ constructed would respect them. The first example of nonmeasur- 
ability of this kind was exhibited by E. Wigner (Z. Physik, 131 (1952), 
p. 101). He showed that if there are additive conservation laws for E, then 
A cannot be measured unless it commutes with the conserved quantity; 
and further, that for approximate measurements of A with arbitrary 
accuracy to be possible, the measuring apparatus should be large enough 
to admit states which are superpositions of sufficiently many states with 
different quantum numbers of the conserved quantity. Actually for 
Wigner, A was the z-composnent of the spin of a spin } particle, the con- 
served quantity being the z-component of the angular momentum. These 
results were proved in full generality by H. Araki and M.Yanase (Phys. 
Rev. , 120 (1960), pp. 622-626). They proved that if L, (resp. L}) isa bounded 
self-adjoint operator on # (resp. #’), and if r>0, vectors (£,) n>, that are 
orthonormal in #’, and a self-adjoint operator E on # can all be found 
such that U =exp(—77E) satisfies the von Neumann criteria for measure- 
ment of A for some initial state € in #’, then for U to commute with 
L=L,@1+1@L, it is necessary that A commutes with L,. Their work also 
showed that when # and #’ are finite dimensional and the cardinality of 
the spectrum of L, is allowed to be sufficiently large, then approximate 
measurements of A (in a certain sense) can be made with any desired 
accuracy. This raises the question, when A, é, and #’ are fixed, of ex- 
plicitly calculating the lower bound of the accuracy of approximate measure- 
ments of A in terms of the variance of L, in the state £. For spin measure- 
ments of spin } particles(when the conserved quantity is the z-component of 
the angular momentum), M. Yanase (Phys. Rev., 123 (1961), pp. 666-668) 
has investigated this question. 
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6. Quantum Probability Theory. For a given logic Z the notions of 
probability measures on ¥ and observables associated to Y define a non- 
commutative generalization of conventional probability theory. For 
standard logics one can develop such a generalization in great depth: see for 
instance the articles in Quantum Probability and Applications to the Quantum 
Theory of Irreversible Processes, Springer Lecture Notes in Mathematics, 
1055 (1984), edited by L. Accardi, L. Frigerio and V. Gorini. 

One of the concepts that is needed for such a theory of Quantum 
Probability, and which goes beyond what we have treated in Chapters III 
and IV is that of conditional probability. Let # be a complex separable 
Hilbert space and 2 = L (H). Suppose p is a probability measure on Z and 
U the associated von Neumann operator. We identify elements of Z with 
the orthogonal projections corresponding to them. If Æ is a projection and 
p(£)>0, the conditional probability measure given that E has occurred is 
defined as the probability measure p (.|H) given by 


(42) p(F|E) = tr(EUEF)/tr(EU E). 


It is not difficult to show that p(.|ÆE) is the unique probability measure on 
¥ with the following property: to any F <Æ, it assigns the probability 
p(F)/p(E). Clearly p(. |E) is defined by the von Neumann operator V where 


(43) V = (tr(UE))- HUE. 
If F is a projection commuting with E we have 
(44) p(T |E) = p(EF)/p(E), 


which is exactly as in the conventional probability theory; however, for 
arbitrary F we should use (42). If p=py where pe # and ||p||=1, we have 


(45) p- |E) =P, Y= |El: Eg. 


Let E = E,+E,+... be an orthogonal sum of projections E}, E, ... 
with p(E a) >0 for all n. If the projection F commutes with all the Z,, we 
have 


(46) p(F|E) = x (p(F|E,) p(L,)/p(2)). 


However, for general F, (46) is no longer true. If p=p, (pe, loll =1) and 
%,=||£,9|-!-H,9, then for an arbitrary projection F we have the correct 
formula 


_ y [IE [Bal B01 
UN oF IB) = E (FE acre) + g (Eel) ry. da) 


The first term in the right side of this formula is the same as that in the right 
side of (46). The additional terms in the right side of (47) may be viewed as 
arising out of the “interference effects” that are typical in quantum theory 
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when F does not commute with the E „. For a very interesting discussion of 
the well-known two-slit experiments of Quantum Mechanics from the point 
of view of conditional probability see Chapter 26 of the book by E. Beltra- 
metti and G. Cassinelli (cf. Notes on Chapter III). We refer to the same place 
for a treatment of conditional probabilities relative to a Boolean sub- 
o-algebra of Z, and, more generally, relative to a sublogic of Z. 


7. Measures on the projections of avon Neumann algebra: Generalizations of 
Gleason’s theorem. Gleason’s theorem may be viewed as a complete descrip- 
tion of all countably additive probability measures on the standard logic 
L(A) when dim(#) > 3. It is natural to ask whether similar results can be 
proved for probability measures on the logic of projections of general von 
Neumann algebras. This question was first raised by G. W. Mackey (Amer. 
Math. Monthly, 64 (1957), pp. 45-57), and it appears that it has now been 
settled completely. 

Let # be a complex separable Hilbert space and y a von Neumann 
algebra of bounded operators on #. We write. P for the logic of projections 
in £ and .* for the real vector space of all self-adjoint elements of... By 
a finitely additive probability measure on. P we mean a map p: £ P -> [0,1] 
such that u(0)=0, p(1)=1, n(P+Q)=p(P)+u(Q) for P, Qe? with 
PQ=0. Given such ap, one can associate to it the corresponding expectation 
functional 

E: ->R 
in the following way: if A e £+ and P4(E —> Pg) is the spectral measure of 
A on the o-algebra & of Borel subsets of R, then 


(48) E (4) = F, rdv(A), 


where v is the finitely additive probability measure H+p(P,) on Z. It is 
easy to see that the correspondence 


>E, 


is bijective from the set of all finitely additive probability measures on. P 
to the set of all functionals 


E: 4 —>R 
with the following properties: 
(49) (i) Æ is linear (over R) on any abelian subalgebra of *; 
(ii) H(A?) > 0 for any Ac; 
(iii) (1) = 1. 


Such functionals are called physical states of x. The concept of a physical 
state is in principle weaker than that of a state of. which demands linearity 
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in place of (i) while retaining (ii) and (iii). If. =B(#), Gleason’s theorem 
implies that for any countably additive u, FE, is a state of B(#). The 
problem of generalizing Gleason’s theorem on £P may then be split into 
two parts: 


(1) to determine for which ¥ all physical states are states; 

(2) if a state of is normal, i.e., if it is countably additive on mutually 
orthogonal projections, to obtain representations of it in terms of the 
usual trace on. (whenever such a trace exists). 


The first problem focuses directly on the issues arising out of the non- 
commutativity of the algebra of observables, and was recognized from the 
beginning to have crucial significance from the foundational point of view. 
Since the linearity of physical states is no longer true when dim(#) =2 it is 
clear that we must assume that Z has no central direct summand of 
type I,. Recent work seems to have now shown that for such an £, all its 
physical states are actually states. This result, certainly one of the most 
fundamental and profound ones in the subject, is the culmination of the 
efforts and contributions of many people spread over several years. In 
addition to the famous paper of Gleason the following (partial) list of 
articles may be cited: J. F. Aaarnes, Trans. Amer. Math. Soc. 149, (1970), 
pp. 601-625; J.Gunson, Ann. Inst. H. Poincaré, A 17 (1972), pp. 295-311; 
A.A. Lodkin, Funk. Anal. Priloz., 8 (1974), pp. 54-58; M.S. Matvieicuk, 
Funk. Anal. Priloz., 15 (1981), pp. 41-53, and Teor. Mat. Fiz., 48 (1981), 
pp. 261-265; E. Christensen, Comm. Math. Phys. 86 (1982), pp. 529-538; 
F.J. Yeadon, Bull. London Math. Soc., 15 (1983), pp. 139-145, and Bull. 
London Math. Soc., 16 (1984), pp. 145-150; and A. Paszkiewicz, preprint, 
to appear in J. Funct. Anal. For a survey of these and other relevant articles 
and a discussion of the ideas used in the proof we refer to the article of 
P. Kruszyński: Extensions of Gleason’s theorem, in Quantum Probability and 
Applications to the Quantum Theory of Irreversible Processes, Springer 
Lecture Notes in Mathematics, 1055 (1984), pp. 210-227, edited by 
L. Accardi, A. Frigerio and V. Gorini. 

The second problem is really part of the study of normal states, and may 
be viewed as a type of Radon—Nikodym theorem. Such theorems go back 
to H. A. Dye (Trans. Amer. Math. Soc., 72 (1952), pp. 243-280), and the 
reader should consult the literature on von Neumann algebras for definitive 
formulations of such results. 


CHAPTER V 
MEASURE THEORY ON G-SPACES 


1. BOREL SPACES AND BOREL MAPS 


From this point onwards we shall emphasize some of the more sophisti- 
cated and specialized aspects of quantum theory. We shall deal only with 
complex separable Hilbert spaces, referring to them as Hilbert spaces 
without any qualification. 

We pointed out in Chapter III the role played by representations of 
physical symmetry groups into the groups Aut( S) and Aut( S), where 
Ẹ is the logic and J is the state space of some quantum mechanical 
system. If we assume that Y is standard, then the results of Chapter IV 
describe rather completely the structure of # and Z. It would thus 
appear quite feasible to study the representations of the groups which 
are important from the physical point of view. Such a study would 
shed considerable light on various aspects of quantum theory. Now, 
the methods used in the theory of representations of groups are 
very sophisticated and depend heavily on analysis on homogeneous 
spaces. The object of this chapter is to present the basic mathematical 
theory of homogeneous spaces and the function spaces associated with 
them. 

The main reference to the theory of measure and integration on groups 
and homogeneous spaces is the book of A. Weil [1]. This however deals 
mostly with compact and abelian groups. The general theory which is 
presented in the following sections is to a very large extent the work of 
Mackey [2], [3], [4], [5], [6], [7]. 

We shall begin with the concept of a Borel space. A Borel structure on 
a set X is simply a o-algebra Z of subsets of X; the pair (X,#) will be 
referred to as a Borel space. Elements of Z will be referred to as the 
Borel subsets of X. We shall use this terminology only when there is no 
ambiguity about the o-algebra involved. By the usual ahuse of language 
we shall refer to X itself as a Borel space. If (X,#) and (Y,@) are Borel 
spaces, and f is a map of X into Y, f is said to be Borel if f-'\(@)c@;, if f 
is one-one, maps X onto Y, and if f~1(@)=&, we shall say that f is a 
Borel isomorphism, and speak of X and Y as being isomorphic. When 


X= Y and f is an isomorphism we shall call f an automorphism. 
148 
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The theory of Borel spaces is a very extensive one. We shall constantly 
use the main results of this theory in our work. For this, our references 
are to Halmos [1], Kuratowski [1], [2]. Articles of Mackey [5] and Black- 
well [1] are also very close to our general point of view. In the next few 
paragraphs we shall summarize what is essential for our purposes; proofs 
of these and other related results may be found in the sources mentioned 
above; cf. also Varadarajan [3], [4]. 

A Borel space (X,#) is said to be separable if (i) {x} e Z for all re X 
and (ii) there is a countable set oF such that Z is the smallest o- 
algebra of subsets of X containing 2; 2 is then said to be a generating 
class. A class 2c is said to be separating if, for each pair of points of X, 
there exists a set A e 9 which contains one but not both of them. It is 
useful to note that in any Borel space (X,#), if D generates Z and 
{x} e Z for all x e X, then 2 is separating. In fact, if this is not true, there 
will be a pair of points x, y e X (xy) such that for each A e 2, either 
{x,y$SA or {x,y} N A= ø. The set of all such A is a o-algebra containing 
2 and hence every set in Z has the above property. But this is impossible 
as {x} does not have the property. 

Let X be a topological space. The smallest Borel structure containing 
all the open subsets of X is called the natural Borel structure of X. When 
X is a metric space, the natural Borel structure coincides with the smallest 
Borel structure with respect to which all continuous functions on X are 
Borel. 

Suppose X, (œx EJ) are Borel spaces and X any set. For each a eJ let 
Ta be a map of X into X,. Then there exists a unique smallest Borel 
structure on X with respect to which all the maps z, are Borel. If Y is a 
Borel space and f a mapping of Y into X, f is Borel if and only if 7, o f is 
Borel for each a. If X is the product space of the X, and mą the projection 
of X on X,, X, equipped with the above mentioned Borel structure, will be 
called the product of the Borel spaces X,. If each X, is a topological space 
and its Borel structure is the natural one, then it is not in general true that 
the product Borel structure coincides with the natural Borel structure 
associated with the product topology on X. This is so, however, if J is 
countable and each X, has a second countable metrizable topology. 

If (X,@) is a Borel space and Y is a subset of X, then the class Zy 
defined by 


By ={BAY: Be 


is a Borel structure for Y. (Y,#,) is said to be the Borel subspace defined 
by Y. The natural injection of Y into X is Borel. Note that Y itself need 
not be a Borel set in X. If X is a topological space with its natural Borel 
structure, then y coincides with the natural Borel structure of Y when 
Y is considered as a topological space with the relative topology. 
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An important class of Borel spaces may be singled out in the following 
way. Let T be a complete separable metric space and X a Borel subset of 
T. Then the Borel subspace defined by X is said to be standard. The 
motivation for this terminology rests on the remarkable theorem that two 
standard Borel spaces are isomorphic if and only if they have the same 
cardinal number and that a standard Borel space is finite, countable, or 
has the power of the continuum. Standard Borel spaces are separable. If 
(X,Z) is a standard Borel space, ( Y,@) is separable, and if f is any one-one 
Borel map of X into Y, then (i) the range Y, of f is a Borel set in Y, (ii) 
the Borel subspace defined by Y, is standard, and (iii) f is a Borel iso- 
morphism of X with Y,. If (X,,4,) (¢=1,2,---) is a sequence of standard 
Borel spaces, then their product is standard. A Borel subspace of a 
standard Borel space is standard if and only if it is defined by a Borel set. 
Any uncountable standard Borel space is isomorphic to the unit interval. 
If X is any locally compact Hausdorff space satisfying the second axiom 
of countability, the Borel space obtained by equipping X with its natural 
Borel structure is standard. 

A measure on a Borel space is any nonnegative set function which is 
countably additive on its Borel structure; +00 is an allowed value. A 
measure p on (X,#) is finite if p(X) < œ; is o-finite if X =|), Xn, where 
each X, is a Borel set and for each n, p(X,,) < œ. p is said to be standard 
if there exists a Borel set X, such that (i) p(X —X,)=0 and (ii) the Borel 
subspace defined by X, is standard. Let X be a locally compact Hausdorff 
space satisfying the second axiom of countability. By a Borel measure on X 
we shall understand a measure with the property that the measure of any 
compact set is finite. Any Borel measure is o-finite. Let C,(X ) be the linear 
space of all complex valued continuous functions on X with compact 
support and let p be any Borel measure on X. Then 


pif>] fau  (feClX)) 


is a linear functional on C,(X) such that f(f)=0 whenever f is real and 
>0. Conversely, if A(f — A(f)) is a linear functional on C,(X) such that 
A(f)>0O whenever f is real and 20, there exists a unique Borel measure u 
on X such that A=,. This is the well known Riesz theorem (cf. Halmos 
[1], pp. 216-249). Any Borel measure on X is regular, i.e., for any Borel 
set ESX, 
(1) p(E) = sup pC). 
C compact 

Given a Borel space (X,#) and a o-finite measure p on it, we call a set 
ASX p-measurable if there exist Borel sets B, and B, such that 
B SAS B, and p(B,—B,)=0. When there is no ambiguity about the 
measure that is involved, we speak simply of measurable sets. The 
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collection of p-measurable sets is a o-algebra and hence defines a Borel 
structure on X, say By; called the p-completion of Z. The measure p has a 
unique extension to #, as a measure. The functions which are Borel with 


respect to the Borel structure Z, are called p-measurable. If f is any 
p-measurable function on X, there exists a Borel function g such that 


f=g almost everywhere, i.e., {x : f(x) Ag(x)} € B, and has p-measure zero. 

Let (X,¥) be a Borel space, let q be any o-finite measure, and let p be a 
measure. We shall say that p is absolutely continuous with respect 
to q, p<g in symbols, if p(#)=0 for any Borel set E for which 9g(#)=0. 


If f is a nonnegative Borel function, the function p : E —> Í , Jag is a 


o-finite measure which is «q; p is finite if and only if f is an element of 
F(q). Conversely, if p is a o-finite measure «q, there exists a nonnegative 


Borel function f on X such that p(E)= le fdq for all Borel sets Æ. f is 
essentially uniquely determined by p in the sense that if f’ is a Borel 
function and p(H#)= f f'dq for all Borel sets E, then f=f'" q-almost 


everywhere. f is called a Radon-Nikodym derivative of p with respect to q 
and is denoted by dpjdq. If p, q, r are o-finite measures, then p«q and 
q<«r implies p«r and the relation dp/dr=dp/dq. dq|dr holds r-almost 
everywhere. In particular, if p«q and g<p, p and q have the same null 
sets and dp/dq-dq/dp=1 almost everywhere; both the derivatives are then 
positive almost everywhere. Let p be any o-finite measure on X, and let us 
choose disjoint Borel sets X,,X.,:-- such that X=), Xn and p(X,)<0o 
for all n. Let the constants c,>0 be such that >, c,p(X,) <20. Then, the 
measure q defined by 


qE) = > onp(E N Xa) (He) 


is finite, and p and q are mutually absolutely continuous. The relation of 
mutual absolute continuity is obviously an equivalence relation in the set 
of all o-finite measures on X. The corresponding equivalence classes are 
called measure classes on X. If t is a Borel automorphism of X and p is a 
o-finite measure on X, we define the o-finite measure p* by 


(2) P(E) = p(t-"(#)) (HE) 


p is said to be invariant if pt =p; p is said to be quasi invariant if p' and p 
are mutually absolutely continuous. Clearly p is quasi invariant if and 
only if the class of null sets of p is invariant under t. A measure class is 
said to be t-invariant if it contains a o-finite measure quasi invariant with 
respect to t. 

We shall now prove two lemmas of a technical nature. They are of 
importance in the sequel. 
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Lemma 5.1 (Federer-Morse [1]). Let X and Y be two mectric spaces 
and let f be a map of X onto Y. Suppose that there exists a sequence {K,,} 
of compact subsets of X such that (i) K SKS», (ii) X=U, Kn, and 
(iii) for each n, the restriction of f to K, 1s a continuous map of K, into Y. 
Then there exists a Borel set ESX such that (i) f maps E onto Y, 
(ii) for any ye Y, E meets f-*({y}) in exactly one point, and (iii) 
Ef *(f[4,))S&,, for all n. 


Proof. We shall first consider the special case when X itself is compact 
and f is a continuous map of X onto Y. By a theorem of Alexandroff 
(cf. Kelley [1], Chapter 5), the space X is a continuous image of a compact 
set C<[0,1]. Let g be a continuous map of C onto X and let h=f og. Then 
h is a continuous map of C onto Y. For any y € Y, h~1({y}) is a nonempty 
compact subset of C and hence has a unique least element m(y), i.e., 


my) € h-*({y}) and m(y) <c for all c e h-*({y}) —{m(y)}. Define 


F = {m(y): ye Y} 
and 
E = g[F]. 


By its definition, F meets each set h~1({y}) exactly once. Hence h is 
one-one and maps F onto Y. From this it follows immediately that f is 
one-one on E and maps E onto Y. In other words, E meets each set 
f(y) (y e Y) exactly once. 


It remains to prove that E is a Borel set. For any integer n> 1, let 
A,<C be defined by 


A, = {c:ceC,h(d) # h(c) for any deC with d < c—1/n}. 


We claim that A, is open in C. Suppose this is not true. Then there exists 
a sequence c, in C such that c, ¢ A, for all s=1,2,---, but c, —> c e Á, as 
s —> œ. Since c, ¢ A, there exists ac,’ <c,—(1/n) in C such that h(c,’) =h(c,). 
If s;<s,<--- is a sequence of integers such that c,’ > c’ as k —> œ, then 
ce EC, c <c—(1/n), and h(c’)=h(c), a contradiction. A, is therefore open. 

We show next that F=(), A,. If ce F, it follows from the definition 
of m(y) that for any c’<c, h(c’)#h(c). Thus c e A, for all n. Conversely, 
let cy E€ A, for all n. If c’ <co, there exists an n such that c’ <cy—(1/n) and 
hence h(c’)#h(c). This proves that co = m(h(co)). 

Now any open set in a metric space is a countable union of closed sets. 
As C is compact, it follows that each A, is a countable union of compact 
sets. Therefore g[A,] also has the same property, and is hence a Borel set. 
We shall complete the proof that # is Borel by showing that E = (n g[A,]. 
Clearly, ES (n g[A,]. Suppose, on the other hand, that x € g[A,] for all 
n and x ¢g[F]. Let y=f(x) and c=m(y). Then ce F and so g(c)#2; thus 
g~*({x})=T is a compact set €C, not containing c and lying entirely to 
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the right of c. Hence there exists a d>0 such that all points of T are at a 
distance >d away from c. If (1/n)<d, then TEC — 4, and as T =g }({z}), 
x¢€g[A,]. This contradiction shows that H=(), g[A,]. 

We now come to the general case. Write X=\), K,, where K, is 
compact, K€ A,¢---, and fis continuous on each K,. There exists, by 
the special case, proved just now, a Borel set D,< K,, such that f is one- 
one on D, and f[D,]=f[K,]. Now, the D,’s need not increase with n. We 
shall show, however, that they may be chosen to possess this property as 
well. We define Borel sets Z,,#.,--- by induction in the following manner: 


E, = D,, 
and for n>1, 


(3) E, = {D,—D, O (f-*(flBa-1)))} O Enea 


We claim that the H,,’s have all the desired properties. Clearly E CH,¢:-.-. 
We claim that for each n, f is one-one on £, and f[H,]=f[K,]. This is 
proved by induction on n. Since Æ, = D,, this is true for n=1. Suppose 
now that n>1 and that f is one-one on E,- with f[#, _,]=f[K,-_,]. Then 
the formula (3) implies easily that f is one-one on E, and maps E, onto 
JKn]. Let H=, En. Then EF is a Borel set, f is one-one on Æ, and 
J[E]= Y. Since E, S&K, it is obvious that E A f~1(f[K,])cK, for all n. 
The lemma is completely proved. 


Corollary 5.2. Let X and Y be standard Borel spaces and f a Borel map 
of X into Y. Let p be a finite measure on X and q the measure E —> p(f-*(E)) 
defined on Y. Then, the range f[X] of f 1s q-measurable and its complement 
has q-measure zero. Moreover, there exist Borel sets A and Z such that (i) 
ASX, Zcof[X], (ii) (Y — Z)=0, and (iii) A is a section for f over Z, i.e., 


fis one-one on A and maps A onto Z. 


Proof. The corollary is trivial if X is countable. Hence we consider the 
case when X is uncountable. We may assume that X and Y are both 
identical with the Borel space associated with the unit interval [0,1]. By 
a theorem of Lusin (Halmos [1], p. 243) we can find compact sets K,¢ X 
such that (a) KLEKS- --, (b) p(X —U, K,)=0, and (c) f is continuous 
on each K,. Let Z=\), f[K,]. Each f[K,] is compact and hence Z is a 
Borel set. Since U, K,Sf-4(Z), p(X —f-4(Z))=0 so that q(Y-—Z)=0. 
This already shows that f[X] is qg-measurable and q(Y —f[X])=0. We can 
now apply the Federer-Morse lemma to (J, K, and construct a Borel set 
ASX such that f is one-one on A and maps A onto Z. The proof of the 
corollary is complete. 


Remark. It is actually true (though it is harder to prove) that the 
range f[X] is a universally measurable set; i.e., for any finite measure « 
on Y, f[X] is an a-measurable set. Corollary 5.2 is valid even in this case, 
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with q replaced by a. We shall not prove these results but refer the 
reader to the paper of von Neumann [2] and Kuratowski [1], [2]. We refer 
to this stronger version of corollary 5.2 as von Neumann’s cross-section 
lemma. 


Suppose X and Y are Borel spaces. Let v be a measure on Y, and for 
each xe X let us be given a v-equivalence class of Borel functions on Y. 
The problem we now want to examine is whether we can select for each 
x a representative f(x,.) in the corresponding class such that the function 
x,y —> f(x,y) is Borel on Xx Y. We need two auxiliary lemmas. 


Lemma 5.3. Let X be a Borel space, S a separable metric space, and 
f a Borel map of X into S. Then there exists a sequence {f,} of Borel maps 
of X into S such that (i) each f, takes only countably many values, and 
(ii) f (x) —> f(x) untformly for xe X (as n > œ). 


Proof. Fix the integer n > 1. Since S is a separable metric space, we can 
cover S by a countable family {B m} of balls of radius 1/2n. Let the sets C m 
be defined by C, = B, and Cm=Bm—U;<m B; (m>1). Then the C,, are 
disjoint and S=\),,C,,,. Since C,,¢ B m, diameter of C „< l/n for all m. For 
each m such that C m is nonempty, let s „ be some point of C m. We define f, 
as the map 


fn(%) = Sm if xef (Cn). 


Then f,, is Borel, has countably many values at most, and dist(f (x), f(x)) < 
1/n for all xe X. 


Lemma 5.4. Let X be a Borel space and S a separable Banach space. 
Let K be a subset of S*, the dual of S, with the property that the linear com- 
binations of elements of K are dense in S*. If f is a map from X to S such that 
for each k e K, the function x — k(f(x)) is Borel on X, then f is a Borel map 
of X into S. 


Proof. Let Z* be the smallest Borel structure on S with the property 
that all the elements of S* are Borel functions with respect to it. From our 
assumption about K it follows easily that for each A e @*, f-4(A) is a 
Borel subset of X. To prove that f is Borel, it is enough to prove that @* 
coincides with the natural Borel structure Z of S. Obviously, @*CcZ. 
Since Z is generated by the open sets, it suffices to prove that any open 
set belongs to #*. Now, as S is separable, every open set is a union of 
countably many balls. Thus it is enough to prove that @* contains all 
balls. Further, as x —> cx (c € C) and x + x+a (a € 8) are easily seen to be 
automorphisms of the Borel space (S,4*), we are reduced to proving that 
the unit ball B={x : ||z|| <1} lies in @*. Now, it is well known that under 
the weak *-topology for S* (=the smallest topology for S* such that 
k — k(x) is a continuous function on S* for each x € 8), the unit ball B* of 
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S* is a compact metric space (Dunford-Schwartz [1]), hence separable. 
Let {x,*,2.*,---} be a countable dense subset of the unit ball of S* in the 
weak «-topology. Then (by the Hahn-Banach theorem), 


B = {x: |g*(x)| < 1 for all «* e B*} 
= N {x : |x,*(x)| < 1 for all n}. 


This shows that Be @* and completes the proof that #*=@. This 
proves that f is a Borel map. 

Now we are in a position to formulate and prove the lemma on the 
selection of representatives. 


Lemma 5.5. Let X and Y be Borel spaces, with Y separable. Let u and v 
be a-finite measures on X and Y, respectively. Let f be a complex valued 
function on XxY and let Y be a union of Borel sets Y, of finite v-measure 
such that (i) for each xe X, y —> f(x,y) is a Borel function on Y which is 
v-integrable on each Y, (ii) for any i and any Borel set FOY,, x—> 
Í p f(zy)dv(y) is a Borel function of x. Then, there exists a complex Borel 
function f* on XxY and a Borel set NSX of p-measure zero such that 
for each xe X—N, 


(4) f*(xy) = f(xy) 
for v-almost all y. 


Proof. We may clearly drop down on one of the Y, so that we may 
assume that v( Y) <oo. Also since u intervenes only in terms of its null sets, 
we may replace p by a finite measure mutually absolutely continuous 
with respect to u; hence we assume that u is also finite. 

Write S=2Z!(v), the Banach space of (equivalence classes of) v-inte- 
grable Borel functions on Y.S is a separable Banach space since the Borel 
structure of Y is countably generated. Since y —> f (x,y) is a v-integrable 
Borel function on Y, it defines an element, say f(x), of S. We claim that 
x- f(x) is a map of X into S which satisfies the conditions of lemma 5.4. 
Now the dual of #}(v) is Z ”(v) and every bounded Borel function on Y 
is a uniform limit of linear combinations of characteristic functions yp 
of Borel subsets F of Y. yp defines the linear functional tp : u —> 


f, Uy)dr(y) on Lv). By hypothesis, x —> t,(f(x)) is Borel for all F. We 


may therefore conclude from lemma 5.4 that x—>f(x) is a Borel map 
from X to S. So X is the disjoint union of countably many Borel sets on 
each of which f is bounded. We may assume therefore that f itself is 
bounded. By lemma 5.3 there exists a sequence {f,} of Borel maps of X 
into S such that (i) each f, is bounded and takes only countably many 
values, and (ii) f (x) > f(x) uniformly in xe X. Since f, is bounded and takes 
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only countably many values, it is obvious that there exists a ux v- 
integrable Borel function f,* on Xx Y such that for each xe xX, the func- 
tion y —> fa* (x,y) defines the element f,,(”) of S. We then have: 


sup Í (fat ley) -Sa* (Eyd) > 0 
xeX JY 


as n, m — œ and hence 


f [fat (0.4) -fnt Eyd x v)(ae,y) > 0 


xXxY 


as n, m —> ©. As L'(u xv) is complete, there is a Borel function f* on 
XxY such that f* is (u x v)-integrable and 


f foley) —f* (ay) |d(w x v) (ayy) —> 0 


xxyY 


as n—> oo. Select a subsequence {f,,*} such that f,,*(a,y) > f*(x,y) for 
(u x v)-almost all x, y. Then there exists a Borel set N of u-measure zero 
such that for each xe X—N, f,,,*(x,y) > f*(x,y) for v-almost all y e Y. 
It is clear that for each x e X —N, f*(z,y)=f (x,y) for v-almost all y. This 
proves the lemma. 


Corollary 5.6. Let X, Y, p, and v satisfy the same restrictions as in lemma 
5.5. Suppose that Q(x >q.) 1s a map from X into the space of all finite 
measures on Y such that (i) q,<«v for all xe X, and (ii) for each Borel set 
FS Y, x—q,(F) tsa Borel function on X. Then, there exists a Borel function 
f* on XxY and a Borel set NSX of -measure zero such that for each 
xe X—N, y—>f*(x,y) is a version of dq,/dv. 


Proof. For each «eX select a Borel function f, on Y such that 
fr2=dq,/dv, and define f on XxY by f(z,y)=f,(y). The corollary follows 
from the lemma at once. 


2. LOCALLY COMPACT GROUPS. HAAR MEASURE 


The groups which are commonly encountered in physical problems, 
such as the Lorentz group or the rotation group, are topological groups 
(cf. Pontrjagin [1] for the theory of topological groups). Moreover the 
underlying topological spaces are locally compact and satisfy the second 
axiom of countability. Because of this and other technical reasons, we 
shall restrict ourselves in the sequel to locally compact groups satisfying 
the second axiom of countability (lese). The Borel structures associated 
with the underlying topologies of these groups are standard. We shall 
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describe in this section a few known facts about lese groups and Haar 
measures. A detailed treatment can be found in Halmos [1] (pp. 266-289). 

Suppose that G' is a group which is at the same time a Borel space. We 
shall say that G is a Borel group if the map x, y > zy~! of G xG into G 
is Borel; G is called a separable (standard, etc.) Borel group if the under- 
lying Borel structure is separable (standard, etc.). In this book we shall 
deal only with separable Borel groups. If G is a separable Borel group, the 
maps x—> 271, x — xh, and x — hx (he G) are Borel automorphisms of 
the Borel space Œ. Any lesc group is a standard Borel group. 

Suppose G is any lesc group. It is a classical fact that there exists a 
nonzero a-finite measure u on G such that u is left invariant, i.e., u(#)= 
p(x) for all Borel sets E and all x e G. p is essentially unique in the sense 
that if u’ is another o-finite left invariant measure, then p’ =c-p for some 
constant c>0. u is called a left Haar measure on G. p(U)>0 for every 
open set U and p is a Borel measure. Similarly there exists a nonzero 
Borel measure p, which is right invariant, i.e., u,(£)=p,(Ex) for all Borel 
sets H and x e G. Again, the right invariance determines u, up to multi- 
plication by a positive constant. u, is called a right Haar measure on G. 

In general, it is not true that a right Haar measure is left invariant. 
When this is so we shall say that G is a unimodular group. Suppose G is 
any lcsc group and that u and u, are left and right Haar measures on G. 
For any zx e G, the measure E —> (Ex) is also a left Haar measure and 
hence there exists a constant A(x) >0 such that 


(5) (Ex) = A(x)u(E) 


for all E. Clearly A does not depend on which left Haar measure we use in 
(5). The function A(x —> A(x)) is known to be a continuous homomorphism 
of G into the multiplicative group of positive real numbers, i.e., 


(i) A(e) =1 (e the identity of G), 
(6) (ii) A(ay) = A(z)A(y) (zy EG), 
(iii) A is continuous. 


(i) and (ii) are easy consequences of (5); the third needs a little work 
(cf. Loomis [2], pp. 117-120). A is called the modular function on G. If f is a 
continuous function vanishing outside a compact set of G, we have, 
from (5), 


(7) [, feeds) = ala) | feedul). 


From this it follows that 


(8) f fe@a@-rdue) = f fleau) 
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The equation (8) shows that the measure u, defined by 


(9) p(B) = | Al~ )dyt) 


E 


is a right Haar measure. It follows from (9) that u, and p are mutually 
absolutely continuous; the Radon-Nikodym derivatives dp,/du and 
dujdu, are, respectively, the functions z —> A(z)~+ and z — A(z). A simple 
calculation using (9) shows that 


(10) (zE) = A(x)~*y,(Z) 


for all Borel sets E and all ze G. 

It is clear from (9) that G is unimodular if and only if A is the trivial 
homomorphism, i.e., A(z)=1 for all z e G. Since x — log A(x) (x e G) is a 
continuous homomorphism of G into the additive group of real numbers 
which does not have any nontrivial compact subgroups, it follows that A is 
trivial on any compact subgroup of G. Compact groups are therefore 
unimodular; in this case u and p, are finite. It is customary, when G is 
compact, to reserve the term Haar measure to the unique left and right 
invariant measure giving measure | to the whole of G. If G is not compact, 
p and p, are not finite. Abelian groups are trivially unimodular. Since A 
is a homomorphism, A(z)=1 whenever z is of the form zyx~+y~! for 
z,yeEG. Let G’ be the smallest closed subgroup of G containing all 
elements of the form zyx~1ty~1 (x, y eG); A is then identically 1 on G”. 
Hence a sufficient condition for G to be unimodular is that G=". 

There exist groups which are not unimodular. Suppose that G is the 
group of all matrices g, 


0 1l 
We identify g with the upper half-plane by the mapping g — (x,y). If 


p(B) = l f A dady, m (E) = J | - dzdy, 


then p and p, are, respectively, left and right Haar measures on G. 


Y x 
g = ) (x, y real, y > 0). 


3. G-SPACES 


Suppose that X is a Borel space and G is a separable Borel group. We 
shall say that X is a G-space if for each g e G, there exists a Borel auto- 
morphism t,(x —> g-x) of X such that 

(i) tis the identity, 


(11) ., 
(ii) Z = bo boa (91, J2 E€ G). 
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X is said to be a Borel G-space if the map g, x — g-x of Gx X into X is 
Borel. We shall say that X is a standard Borel G-space if X is a Borel 
G-space and if the Borel structure of X is standard. For any G-space X 
and any x € X, the set 


(12) G,={9:gEG, g-x=2x} 
is a subgroup of G. It is called the stability subgroup at x. The set 
(13) G.a={g-a:geEG} 


is called the orbit of x. If y=h-x (h e G), then G,=h-G,-h-' as is easily 
verified. Thus the stability subgroups of the points on the orbit of x are 
conjugate to G, and any subgroup conjugate to G, is the stability subgroup 
of some point on the orbit of x. 

If X is a Borel G-space and Y is a subset of X, we shall say that Y is 
G-invariant if G-xc Y for all xe Y; in this case, the Borel subspace 
defined by Y becomes in a natural and obvious fashion, a Borel G-space. 
It is called the Borel G-subspace defined by Y. If X, and X, are two Borel 
G-spaces, and f a Borel map of X, into X., then f is called a G-homo- 
morphism if for all x, € X,, and all geG, 


(14) f(g-%1) = 9-f(%1). 

A G-isomorphism is a G-homomorphism f with the additional property 
that f is a Borel isomorphism of X, onto XQ. If there exists a G-isomorph- 
ism of X, onto Xz, we shall call X, and X, isomorphic Borel G-spaces. 

Suppose that X is a locally compact Hausdorff space satisfying the 
second axiom of countability and, further, that G is a lese group. Let X 
be a °G-space with the additional property that the map g, x —>g:x of 
Gx X into X is continuous. Then X is obviously a Borel G-space. We shall 
refer to it as a locally compact G-space. G is said to act continuously on X. 
In this case, for each g € G, x —> g -x is a homeomorphism of X onto itself. 
Since G is a countable union of compact sets, G-x is also a countable 
union of compact sets for any xe X. Every orbit in a locally compact 
G-space is thus a Borel set. The stability subgroups are, moreover, closed 
subgroups of G. 

A measure p on a Borel G-space X is said to be invariant if p(g- E)=p(E) 
for all Borel sets E€ X and all g e G. In many problems it is necessary to 
work with measures which are only quasi invariant. A measure p on X is 
said to be quasi invariant if p(#)=0 if and only if p(g- #)=0 for g € G, i.e., 
if the action of G on X transforms p-null sets into p-null sets. Notice that 
if p is quasi invariant and f,, fẹ are two functions on X which are equal 
almost everywhere, their transforms f, and f? also have this property 
for any g e G, where f/9(x)=f,(g~1-x) (x e X, i=1,2). For any measure p 
on X and any g e G, let us define the measure p° by 


(15) pL) = p(g-*- #). 
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p is quasi invariant if and only if p° and p are mutually absolutely con- 
tinuous for all g e G. Therefore, if p is quasi invariant, so is any measure 
mutually absolutely continuous with respect to p. If X is a locally com- 
pact G-space and p is a Borel measure, then so is p°. 

A simple example of a G-space (G any lese group) is obtained when we 
take X=G and define, for xe G and géG, g-x=L,(x)=gx. Obviously 
G=G-e and the stability subgroups are all trivial. If » is a left Haar 
measure, u is invariant. If p, is a right Haar measure, p, is not in general 
invariant; but it is quasi invariant since it is mutually absolutely 
continuous with respect to u (cf. (9)). 

A Borel G-space X is said to be transitive or homogeneous if X =G. x for 
some x € X. In this case, X =G-y for each y e X. The above example is a 
simple example of a transitive G-space. If X is a standard Borel G-space 
which is transitive and if the stability groups associated with the points 
of X are all trivial, then we shall say that X is an affine space associated 
with G. If we assume G is standard and choose a point x e X, the map 
g —>g-x is a Borel isomorphism of G onto X which is a G-isomorphism of 
the G-space G (G acting on itself by left translations) on the G-space X. 
But this isomorphism depends on the choice of x. 

We shall conclude this section with a theorem due to Varadarajan [2] 
which imbeds a standard Borel G-space in a compact metric G-space, G 
being any lese group. 


Theorem 5.7. Let G be a lesc group and X be any standard Borel G-space. 
Then there exists a compact metric G-space Y, and an invariant Borel set 
ESY such that X is G-isomorphic to the Borel G-subspace of Y defined 
by E. 

Proof. Let p be any left Haar measure on G. Denote by #? the Banach 
space of (equivalence classes of) complex Borel functions f on G such that 


(16) Ifl = f Folde < 00, 


If heG and f*(z)=f(h-4z) for xeG, fe Lt and |/f"],= lf] It is 
well known that f,h— f” is a continuous map from ZtxG into 2 
(Loomis [2], p. 118). Let F be the complex vector space of all complex 
valued bounded Borel functions on X, and for ue F let 


(17) [ulo = sup ju(x)|. 


Since the map g, x—>g`t-x of @GxX into X is Borel, the function 
g, x —> u(g71-x) is Borel for each we F. If fe Z!, the function x —> 
f of (g)u(g-*+x)du(g) is Borel on X; we shall denote it by f * u. It is obvious 


that fxueF and |fxullo<|fil.-ulo..u—>f*uis a linear map of F 
into itself. 


MEASURE THEORY ON G-SPACES 161 


For any nonempty set YGF we write Y(QM) for the set of all complex 
valued maps e(f,u—c(f,u)) on x2 such that (i) for each we F, 
f — e(f,u) is linear on #1, and (ii) |e(f,~)| < If lla lu] o for all f, we £t x 2. 
We equip Y(2) with the smallest topology with respect to which all the 
maps c—>c(fu)(fe',ueD,ce Y(M)) are continuous. From the 
defining properties (i) and (ii) it is obvious that Y(@) is a compact Haus- 
dorff space. If 2 is countable, it is clear that Y(2) is even a compact 
metric space. We shall now convert Y(2) into a G-space. We define, for 
ce Y(2) and g e G, the element g-e by 
(18) g-c( fu) = (f9 ?,u), 
where, as usual, f"(r)=f(h-1-2) (x, heG@). Since |f =|] it is 
clear that g-ce Y(2), and a simple calculation shows that Y(2) is a 
G-space. 

We now claim that the map g, c > g-c of Gx Y(Q) into Y(Q) is con- 
tinuous. In view of the definition of the topology on Y(Q) this reduces to 
proving that for each (fiu)e 1x2, the map g,e—>ec(f%*,u) from 
GxY(Q) into the complex numbers is continuous. However, an easy 
calculation shows that the inequality 


lo(f7,u)—e'(f*,u)| < |e(f?,u)—e'(f%,u)| + | f?—f" lla lulo 
is valid for all c, c'e Y(Y) and g,heG, and the right side tends to 0 
when A —> g and ce’ —> c. G thus acts continuously on Y(Q). 

We shall now imbed X in Y(Q) for a suitably chosen 2. Let A,,Ag,--- 
be a sequence of subsets of X which generate the Borel structure of X, 
and let u, be the function which is 1 on A, and 0 on X—4A,. Let 2= 
{ui U,--+} and let Y= Y(2). Since 2 is countable, Y is a compact 
metric space. For any x e X let us define c, by 


(19) c,(f.u) = (f * u(x) (Je, ue). 


Obviously c, e Y. x > c, is thus a map, say €, of X into Y. Since the 
map x — (f » u)(x) is Borel for all fixed f, u, it follows that £ is a Borel 
map of X into Y. Further, if g e G, we have: 


Cy.x( fu) = (f* u)(g-x) 
= | fyu(a-*-9)-2\du(h) 


= [_slatyue-*- ayant 
= (f7 x u)(z) 


= (g-c)(f,w). 


This proves that ¢ is a Borel G-homomorphism from the G-space X into 
the G-space Y. Let E be the range of £. E is clearly invariant. 
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We assert that £ is one-one. Suppose in fact that x, y are two points of 
X such that (fx u)(z)=(f*u)(y) for all f, ue x2. Then, for each 
u e 2, u(g~1-x)=u(g~1-y) for almost all g e G. As 2 is countable, there 
exists a Borel null set NSG such that for any ge G—N, u(g-}-x)= 
u(g~+-y) for all u e 2. Let go be any point of G—N, and let x), =g,)~ 1-2, 
Yo=9o *-y. As the Borel space X is separable and as A,, Ag,-:- is a 
generating family, it is also a separating family (cf. Section 1). Hence, 
U(%o) = U( Yo) for all u e D implies that za =y. Therefore x= y. 

Therefore é is a one-one Borel map of X into Y. Since X and Y are 
standard, this implies that the range EH of ¢ is a Borel subset of Y and 
that é is a Borel isomorphism of the Borel space X onto the Borel sub- 
space of Y defined by E. Since we have already proved that Ẹ is an 
invariant subset of Y and that € is a G-homomorphism, the proof of 
theorem 5.7 is complete. 


Corollary 5.8. Let G be a Icsc group and let X be a standard Borel G-space. 
Then, for any x e X, the stability subgroup G, (of G) at x is closed. Moreover, 
the orbit G-x of any x in X is a Borel subset of X. 


In fact, it is enough, in view of theorem 5.7, to prove these facts when 
X is an invariant Borel subset of a compact metric space on which G 
acts continuously. In this case, both assertions are obvious. 

We shall end this section with two technical results which describe the 
manner in which the quasi-invariant measures on a Borel G-space trans- 
form under the action of G. For any o-finite measure p on X we define 


p? by (15). 


Lemma 5.9. Let X be any Borel G-space, G, a lcsc group. Let p be a 
o-finite quasi-invariant measure on X and let f,(g,.) be a version of the 
Radon-Nikodym derivative dp/dp®~ >. Then for fixed 91, 92 € G, 


(20) Sr(919 25%) = fo(91,92°%) fo(G2,%) 
for p-almost all x. 


Proof. We begin the proof with a formula which describes how a 
Radon-Nikodym derivative transforms under a Borel automorphism. 
Let u,v be o-finite measures on X such that p«v. Let t(x —>t-x) be a 
Borel automorphism of X. Define the measures pê and » by setting 
p'(#)=p(t-*(#)) and +(E)=v(t7}(E)), for all Borel sets E. Then p&r. 
If E is any Borel set, 


W(P) = f, (duldsy (aya 


(E) 


= I, (dp|dv)(t~1-2)dv'(x), 
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showing that 
(21) du'/dv' = (dp/dv)'. 


This said, we come to the actual proof. Fix g,,g.¢G. Then p929 = 
(p%)%2 and dp/dp 2%.) = (dp/dp%)-(dp/dp%)% from what we said above. 
This implies easily that 


fr(9192,%) = fp(91,92°%)fp(9o,%) 


for p-almost all x. 
Our next result tells us that we can choose these Radon-Nikodym 
derivatives so they are “almost” Borel on Gx X. 


Theorem 5.10. Let G be a Icsc group, p a left Haar measure on G, and let 
X be a separable Borel G-space. Let p be a o-finite quasi-invariant measure 
on X. Then, there exists a Borel function F, on Gx X which is positive 
everywhere such that for p-almost all g e G, x — F,(g,x) is a version of the 
Radon-Nikodym derivative dp/dp°~". In particular, the identity 


(22) Fagg) = £'9(91,92°%)f(G2,%) 
ts satisfied for u x u x p-almost all (gga £) EGxGx X. 


Proof. We start with the special case when yp is finite. We first claim 
that for each Borel set Ho X, g — p(g-E) is Borel. In fact, the map 
t:g,2x—>g,g-x is a Borel automorphism of GxX and hence A= 
t(G x E] is a Borel subset of Gx X. Let u be a left Haar measure on G. 
Then we know that for each g e G, the set A9={zx : (g,x) € A} is a Borel 
subset of X and g — p( A?) is a Borel function on G. Since A9=g-#, our 
claim is proved. 


We are now in a position to apply corollary 5.6. It follows from this 
corollary that there exists a Borel function F, on Gx X such that for 
almost all g e G, x —> F,(g,x) is a version of dp%~»/dp. Since p is quasi 
invariant, for each g, F',(g,”) is p-almost everywhere positive and hence, 
by the Fubini theorem, F, is positive for almost all points of Gx X. We 
may thus assume that F, is positive everywhere without altering the 
conclusions, Put F, ,=1/F,. Then F is a positive Borel function on 
Gx X and for -almost all g, x > F,(9,x) is a version of dp/dp~. Finally, 
the identities (22) follow from (20). This completes the proof of the 
theorem when vy is finite. 

If p were a-finite, we choose a finite measure q such that q«p and 
p<«q. Let a(x — a(x)) be a positive Borel function which is a version of the 
Radon-Nikodym derivative dp/dq. Choose F, as a Borel function on 
Gx X corresponding to q, and define 


(23) F (9,0) = a(x): F,(9,2)-a-*(g-). 
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The identity 

dp/dp™ = dp/dq-dq/dq™ - (dq/dp)™ 
implies that F, has all the required properties. 


4. TRANSITIVE G-SPACES 


We shall now examine the transitive G-spaces somewhat more closely. 
Let G be a lese group. Let Go be a closed subgroup of G. Let X = GG, the 
space of all left cosets gGo, g e G. Let B be the canonical mapping of G 
onto X defined by 


(24) B : 9 > gGo, 
and for any element g of G and =hG, of X let g-x be defined by 
(25) gx = ghG. 


Clearly, g-~ depends only on g and x and not on the element h used to 
define x. G thus acts on X and this action is transitive. Let X be equipped 
with the quotient topology, that is, Uc X is open if and only if B~1(U) 
is open in G. A map c(x — c(x)) of X into G is called a Borel (continuous) 
section if it is a Borel (continuous) map and if B(c(x)) =x for all x e X; then 
its range meets each left G coset exactly once. It is said to be regular if 
for each compact subset K of G, cLX] A B~1+(B[K]) is a set with compact 
closure. 


Theorem 5.11. X, under the quotient topology, is a locally compact 
Hausdorff space satisfying the second axiom of countability and B is an open 
continuous map. Moreover, X is a G-space and the map g,x-—>g-x of 
Gx X into X is continuous. If K is any compact subset of X, there exists a 
compact subset K, of G such that B[K,|)=K. Further, there exists a regular 
Borel section for G/Gy. A map f of X into some Borel space Z ts Borel if 
and only if the map f°=f>B of G to Z ts Borel. In particular, if ESX, 
E is a Borel set if and only if the set B-1(E) is a Borel subset of G. Finally, 
let Y be a transitive standard Borel G-space, y e Y and let Gy be the stability 
subgroup at y. Then the map t:B(g)—>g-y is a Borel isomorphism of 
X=G/G> onto Y which is a G-tsomorphism of the associated G-spaces. If 
Y is a locally compact G'-space satisfying the second axiom of countability, 
t is a homeomorphism. 


Proof. That X is locally compact Hausdorff second countable and that 
B is open are well known (cf. Pontrjagin [1]). 


Now, G is clearly transitive on X. The map (g.g) — (g,B(g’)) of Gx G 
onto Gx X is continuous and it is easily shown that the topology for 
Gx X is the quotient topology relative to this map. The map g, x —>g-x 
is therefore continuous. 
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Let W,, Wa,- -- be a sequence of compact sets in G with nonempty 
interiors such that (J, (interior(W,,))=G. If then K is a compact set S X, 
then for some n, 


Ks Ù BW, 
j=1 


and hence K=f[K,], where K, =8-1K) A (U?-, W,). Clearly K, is 
compact. 

We shall next prove the existence of a regular Borel section for G/@p. 
Let {K,,} be a sequence of compact subsets of G such that (a) KLEKS- 
and U,K,=G, and (b) any compact subset of G is contained in some 
K,. Using lemma 5.1 we can construct a Borel set Æ such that (i) 
EO B-1(8[K,])¢&,, for all n, and (ii) B is one-one on E and maps E onto 
X. ß is thus a Borel isomorphism of E onto X as # and X are both 
standard. If c denotes the inverse map, c is a Borel isomorphism of X onto 
E. cis a Borel section. It is obvious from (i) and (b) above that c is regular. 

Suppose now the set A€ X is such that B-1(A) is a Borel set in G. Then 
E A^ B-1(A) is also Borel. Since 8 is one-one on Æ, and since the Borel 
subspace defined by Æ is standard, A =f[# N B-+(A)] is a Borel subset of 
X. If f is a Borel map of X into Z, where Z is some Borel space, then 
f°=f o Bis a Borel map of G into Z. Conversely, let f° be a Borel map of 
G into Z such that f° =f  B for some map f of X into Z. If F is any Borel 
subset of Z, B~1(f-1(F))=f°-1(F) is Borel in G and hence f~+(F) is 
Borel in X. f is thus a Borel map. 

We finally come to the uniqueness of transitive G-spaces. Let Y, y be 
as in the statement of the last assertion of theorem 5.11. Clearly the map 


t : Pg) > gy 

is one-one, maps X onto Y, and is moreover such that t(g-x)=g-#(x) for 
all xe X and geG. If McY is open, t-+(M)=§f[A], where A= 
{g :g-y e M} so that ¢~1(M) is open. t is thus continuous. A classical 
category argument (Pontrjagin [1]) now implies that ¢ is a homeomorphism. 

Suppose we assume in the above proof that Y is only a standard Borel 
G-space. We then use corollary 5.8 to conclude that G, is a closed sub- 
group of G. Thus ¢ is well defined and the same argument as that given 
above now shows that ¢ is Borel. Since both X and Y are standard, t is 
a Borel isomorphism. This completes the proof of the theorem. 


Since G acts on X, we may speak of invariant measure classes on X. 
We shall now proceed to study these. We use a “lifting” technique to 
reduce the problem to G itself. Now G acts on itself by left and right 
translations. We have accordingly left and right invariant measure 
classes. We write p for a left Haar measure and define u, by 


(26) h(E) = p(E~*). 
u is a right Haar measure. 
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Lemma 5.12. There exists a unique measure class on G which is left (or 
right) invariant, and it is the measure class determined by p. In particular, 
it is both left and right invariant. 


Proof. Let « be a finite right quasi-invariant measure. We want to 
prove that «(#)=0 if and only if »(#)=0. Let f be a Borel function on G 
such that 

(i) O< f(z) <1 for all xe G, 


Gi) | Sedu | flardun(e) < o. 


Such f are easily constructed. Suppose now for a Borel set A, a(4)=0. 
Since « is right quasi invariant, e(4x)=0 for all x in G and hence 


[_ o(4e-Dyterdu(e) = 0, 
i.e., writing y, for the characteristic function of A, 
f (f xaozdda(wn) erate) = 0. 
By Fubini’s theorem, this means 
f (f xawertterdute)da(y) = 0 
and hence, as the inner integral is >0, 
Í  xalverflerdute) = 0 
for a-almost all y. Thus, for some yo € G, 
f xalvor)f(edute) = 0 


and hence, as f(x) >0 for all x, 


Xal Yor) = 0 


for p-almost all v, i.e., u(Yọo714)=0 so that u(4)=0. Conversely, if 
p(A)=0, p(y~1A)=0 for each y and hence 


f xalweyfterdul) = 0 
for each y, proving that 


l, (l xalya)f (odua) day = Q. 


Using Fubini’s theorem, we get: 


f o4e-ferdu(e) = 0 
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and hence, as the integrand is >0, 
al Axo” 1) = 0 


for some zx, showing, as « is right quasi invariant, that a(A)=0. Thus 
Uur œ, u are all in the same measure class. If «œ is left quasi invariant, the 
measure & defined by &(A)=a(A~?) is right quasi invariant so that 
u, œ, u, are in the same measure class. This proves the lemma. 


Remark. Lemma 5.12 enables us to speak without ambiguity of null 
sets in G; a Borel set A€ G will be called null if it is a set of measure zero 
for the unique measure class singled out by lemma 5.12. 


We now return to the general transitive case. Let Go be a closed sub- 
group of G and X=G/G,). We now observe that, on X, quasi-invariant 
measures can be constructed easily. Let a» be a finite measure on G which 
is quasi invariant, and let a be the measure on X defined by «(A)= 
«(8 +(4)). Then o(#)=0 if and only if 8~?(#) is a null set in G. «æ is thus 
quasi invariant. Our aim is to show that all quasi-invariant measures can 
be obtained in this manner, up to absolute continuity. 

Let C,(G) denote the class of all complex functions on G which are 
continuous and have compact support. Let C,(X) denote the analogously 
defined space of functions on X. For any f in C,(G) let us write Mf for the 
function on G defined by 


(27) (Mf)(g) = f f(gh)duo(h); 


here yy is a left Haar measure on the lese group Gy chosen in some way. 
It is then clear from the left invariance of uo that, for any hy in Go, 


(28) (Mf)(gho) = (Mf)(9) 
for all g e G. Hence there exists a function f~ on X such that 
(29) f° (BQ) = (MFg) — (g € G). 


We shall use the mapping f —> f~ to lift measures from X to G. 


Lemma 5.13. Let P be the set of all real nonnegative elements of C,(X) 
and P’ a subset of P having the following properties: (i) if fi, fa € P' and 
Cy, Cy are constants >0, ¢, fi, +cof,e P', (i) if fe P' and fre P, fife P, 
and (iii) for any x € X, there is an f e P’ with f(x)>0. Then P' =P. 


Proof. Let xe X. Then there exists an f,c P’ such that f,(x)=1. 
Therefore, for some open set U, with compact closure and containing zx, 
f(y) =4 for all y e U,. Let fı be an element of P such that f,(y)=f,(y)~? 
for all ye U,. Then f,’=f,f, € P’ and has the property that f,’(y)=1 
for all y e U. Suppose now that K is any compact subset of X. For each 
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xe K, let U, be an open set containing x and f,’ an element of P’ such 
that f,’=1 on U,. Choose points x,,---, x, € K such that KCU#_, Uz, 
Then f'= f}, +: + fx, belongs to P’ and f’(y)>1 for all y e K. Choose 
an fae P such that f.(y)=f'(y)~! for all ye K. Then f"=f,f’ belongs 
to P’ and f"(y)=1 for all y e K. Finally, let fe P and let K be a compact 
set such that f(y)=0 for y ¢ K. Choose an f” e P’ such that f"(y)=1 for 
all y e K. Then f=ff" e P’. This proves the lemma. 


We are now ready to formulate the main properties of the map 
fof. 

Lemma 5.14, For any feC (G), f~ €C(X). The map f—>f~ has the 
following properties: (a) f=0 implies f~ >0; (b) f—>f~ maps C,(G) onto 
C.(X) and maps the set of real nonnegative elements of C,(G) onto the set of 
real nonnegative elements of C.X); (c) for any continuous function u on X, 
(uf) =uf~ (where u’ =u o B) (d) if a € G, (f2)~ =(f~)*. 

Proof. We first prove that for any f in C,(G), f~ €C,(X). Clearly if f 
vanishes outside a compact set KEG, f~ vanishes outside [K]. Therefore 
there remains only the continuity of f~. For this it suffices to prove that 
Mf is continuous on G since then the continuity of f~ would follow from 
the fact that X has the quotient topology. Let gọ € Gand W, be a compact 
neighborhood of e. Now 


MPNO) (MPI) < | LFloh)—F(goh) duo) 


Since feC(G), given e>0, there exists a compact neighborhood 
WEW, around e such that |f(a)—f(a')| <e whenever aa’-1e6 W. If f 
vanishes outside the compact K, and gg,~! € W, 


|(MP)(9) —(Mf)(go)| < If (gh) —f (Goh) |du-o(h) 


(go 1W-1K)NGo 


Se duo(h), 
K'NGo 
where K’=(g,-1W,7~!K) A Go. This proves that Mf is continuous at go, 
finishing the proof that f~ e C.(X). 
The map f—>//~ is linear and it is obvious that f>0 + f~ >0. Next, 
let u be a continuous function on X and u°=wuo B. Then 


M(wf)(g) = ulg) f f(gh)duo(h). 


This shows that M(u°f)=u°Mf. Hence (w°f)~=uf~. Third, let ae G. 
Then (Mf*)(g)=(Mf)(a~*g) from which we conclude that (f*)~ =(f~)*. 
This proves that (a), (c), and (d) are satisfied. It only remains to prove (b). 
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Let 
P' = {f~ :fec(G@), f real and >0}, 


and let P be the set of all real nonnegative elements of C,(X). We claim 
that P’ satisfies the conditions of the lemma 5.13. (i) is obvious and (ii) 
follows from (c). To check that (iii) is satisfied at each point of X, it is 
enough to do so at xọ=ß(e) because G acts transitively on X and P’ 
satisfies (d). But then, if feC,(@)>0 and is 1 in a neighborhood of e, 
(Mf )(e) =f ~ (zo) > 0 obviously. This shows now that P’= P. In particular, 
C'(G) is mapped onto C,(X). The lemma is completely proved. 


For any Borel measure « on X, the map f— h f~da is, by virtue of 


lemma 5.14, a nonnegative linear functional on C,(@). Let us denote by 

a? the Borel measure defined on G by this linear functional. We thus have 

a mapping « —> «° from the set of Borel measures on X into the set of 

Borel measures on G. Since f —> f~ has the whole of C,(X) for its range, it 

is obvious that «,°=a,° implies a, =&z. a —> a? is therefore a one-one map. 
Let us now consider the equation 


(30) f e= | foda (fe CG) 


Our next aim is to extend (30) to certain Borel functions on G. 


Lemma 5.15. Let f be a Borel function on G such that 


(31) Í |f (gh) |dug(h) < 00 


0 


for each g e G. Then there exists a unique Borel function f~ on X such that 
for allg EG, 


f>(B(g)) = [ fighjduolh). 


If f 20, then f~ =0 and we have 


(32) f fdo® = f f~da 


in the sense that either both sides are infinite or both sides are finite and equal. 
If f is bounded and vanishes outside a compact subset of G, then f ~ is bounded 
and vanishes outside a compact subset of X. 


Proof. Since g, h —>f(gh) is Borel on GxG, it is clear that g— 
Í Go f(gh)duo(h) is Borel on G. Moreover, this function of g is constant on the 
left G,-cosets. Hence, by theorem 5.11, the existence and uniqueness of 
f~ are immediate. Also, f>0 implies f~ 20. Suppose f(g) =0 for all g ¢ K, 
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K being a compact set, and let |f(g)| <A for all g e G. Clearly, f~(x)=0 
for x ¢ B[K]. Further, if x e [K] and we write x= f(g) (g e K), 


f(a] = f Fahd) 


<A] duol) 
9~3KNGo 
< Ap(K-1K 0G). 


This proves that f~ is bounded. 

We now come to the proof of (32). We consider first the case when 
f=xx, the characteristic function of the compact set K&G. We can 
choose a sequence {fa} of nonnegative elements in C,(G) such that fn | yx 
pointwise on G. Then fa~ | f~ pointwise on X and 


f fda? = lim | fada? 
G n= o 


G 


lim | f,~do 


n — © x 


f f~da. 


This proves (32) in this case. Next, for any Borel set ACK, f=y4 is 
bounded and has compact support and hence y,~ is bounded and has 


compact support. It is therefore «-integrable and the map A —> fa xa ~da 
is easily verified to be a measure on the Borel sets of K. By what we have 
seen earlier, this measure coincides with «°? on all compact A S K. Therefore 
it must coincide with «° for all Borel sets A. But then this proves (32) for 
f=xa (ASK). At this stage we know that (32) is true for all simple 
functions f vanishing outside compact sets. 

For the general case, let f>0 and let {f,,} be a sequence of nonnegative 
simple functions with compact supports such that f, + f pointwise on G. 
Then f,~ t f~ pointwise on X. Hence by the Fatou-Lebesgue lemma, 


f fdo? = lim [ fda 
G 


il 
= 

5 
mao 
2 
Qu 

R 


Il 
=~ 
? 
Q 

R 


This completes the proof of the lemma. 


Corollary 5.16. For a Borel se ASX, a(A)=0, if and only if 
a°(B~*(A))=0. 
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Proof. Suppose that A is a Borel set < X and a(A)=0. If K is a compact 
set <8~1(A), then, yg~ =0 outside A and so, by (32), 


a°(K) = Í XK ~ da 
= (). 


This proves, by regularity of «°, that «(8-1(A))=0. Conversely, let 
«°(B-1(A))=0. Let KC A be compact. Since 8~+(K) is closed, there exists 
an increasing sequence {K,} of compact subsets of p71(K) whose union is 
B-1(K). Then «°(K,)=0 for all n, so that we can infer from (32) and the 
nonnegativity of xg,” that yx, ~=0 almost everywhere for each n. 
Suppose now that a(K)>0. Then, there will exist an x, e K such that 
Xx, (%1)=90 for all n. Choose a g, €8~*(K) such that B(g,)=2,. Then 
uol(gi Kan) A Go) =0 for all n. This is absurd as (g,~1K,) O Go t Go. Thus 
a(K) must vanish. By regularity of a, «(A)=0. 


Corollary 5.17. If a, and a, are Borel measures on X, aKa, if and only 
of æ? Ka’. In this case, 


(33) dæ? |da? = (dæs/|daæ,)?. 
Proof. Let a,«a, and let t=da,/da,. We may assume t20. For any 


nonnegative u € C.(@), the function ut? satisfies the conditions of lemma 
8.15 and, moreover, 


(ut?)~ = ut. 


Since u~t e S (a), ut? e H4(a,°) and we have: 
Í ut’daæ? = Í u~ tda 

G X 
‘ =Í u~ dags 

X 


= f udaæg?. 
G 


This shows at once that «,?&«a? and dag°/da,°=t°. On the other hand, 
if «,°<«a,°, we obtain the relation a,.«<a, at once from the previous 
corollary. 


Corollary 5.18. If G is compact, then for any Borel set ASX, a(A)= 
a®(B~*(A)). 

Proof. In this case, on normalizing u and py as usual, we see that for 
any weEC,(X), (u?)~ =u. Hence, f x uda = f o uda’. This implies the 
corollary. 
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From lemmas 5.14 and 5.15 the following theorem follows immediately. 
It is a basic theorem in the theory of quasi-invariant measures on X. 


Theorem 5.19. There are quasi-invariant Borel measures on X=G/Go, 
where G is a lesc group and Gy a closed subgroup. If À is a finite quasi- 
invariant measure on G, and A~ is the finite measure defined on X by the 
equation X~(A)=AXA(B-+(A)), then A~ is quasi invariant. Any two quasi- 
invariant oa-finite measures on X are mutually absolutely continuous. If 
ESX is any Borel set, the quasi-invariant o-finite measures on X vanish for 
E if and only if B-4(E) is a null set of G. If «1s any Borel measure on X, «a 
is quasi invariant (invariant) if and only if «a? is a quasi-invariant (left 
invariant) measure on G. 


Proof. If g eG and if « is a Borel measure on X, (af)? = (a°)9 as is easily 
seen from (d) of lemma 5.14. Moreover, any o-finite measure is mutually 
absolutely continuous with respect to some finite measure. Lemmas 5.14 
and 5.15 and corollaries 5.16 and 5.17 then lead quickly to the proofs of 
all the assertions of this theorem. 


Theorem 5.19 enables one to speak of null sets of X and G without 
specifying any quasi-invariant measure. If A is any quasi-invariant 
measure on X, the class of A-measurable sets is independent of À. In other 
words, we may also speak of measurable sets and measurable functions on 
X without ambiguity. 


Corollary 5.20. Let ASX. Then A is measurable if and only if B~3(A) 
is measurable. If f is a map of X into some Borel space Z, f is measurable 
if and only if f? =f o B is a measurable map of G into Z. 


Proof. It is enough to prove the first part since the second part follows 
immediately from the first. Suppose now that AS X is measurable. Then 
there are Borel sets A, and A, such that 4,C ACA, and A,—A, isa 
null set. Clearly B-~+(A,)GB~\(A)GB~*(Az) and B-4(A,)—B7-1(A 
B-1(A,—A,) is also null. 8~1(A) is thus measurable. 

Conversely, let B=8~1(A) be measurable. Let A be a finite quasi- 
invariant measure on G@ and let à~ be the induced measure on X. There 
exist sequences {K,} and {K,’} of compact subsets of 4 such that (i) 


we and K,’<SG—B for all n, and (ii) A(B—, K,)=A((G— B)- 
Un K,’)=0. Since £ is continuous, the sets A, and Ao, whore 
A, = U B[K,.] 


X-A, = UPIK,’ 


are both Borel. It is obvious that A,C ACA, and A~(A,—A,)= 
A is thus measurable. 
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Corollary 5.21. G acts ergodically on X, i.e., if f is a Borel map of X 
into some Borel space Z such that f(g-x)=f(x) for almost all (g,x) Ee G xX, 
then f is a constant almost everywhere. In particular, there exists, up to a 
constant multiplying factor, at most one invariant o-finite measure on X. 


Proof. By the Fubini theorem, there exists a point x, of X such that 
f (9-21) =f(x,) for almost all g. We may clearly assume that x, =x) =((e). 
Select a Borel set ACG such that A is the union of countably many 
compact sets, G—A is of Haar measure zero, and f(g-%)=f(% ) for 
ge AÁ. Let X,>=A-xX. Then X, is a Borel set and B-1(X —X,.)GG—A is 
null, showing that X — X, is null. Let a, and a, be nonzero invariant ø- 
finite measures. Then they are quasi invariant. Let f=da,/da,. Then, for 
each geG, f(g:x)=f(x) for almost all x. f is thus a constant almost 
everywhere, proving that a, is a constant multiple of «z. 


Corollary 5.22. Let S be a standard Borel space and let f be a Borel map 
of G into S. Suppose that for each he G, f(gh)=f(g) for almost all g. Then 
there exists a Borel map a of X into S such that f(g)=a(B(g)) for almost 
all g. 


Proof. We may clearly assume that S is the unit interval. Let à be a 
quasi-invariant measure on G with A(G@)=1. Let A, be a quasi-invariant 
measure on Gy with Ao(Go)=1. Let A~ be the measure on X induced by A 
through the map 8. By the Fubini theorem, f(gh)=f(g) for almost all 
(g,h) E Gx Go. Hence, for some Borel set N of G of measure zero, we can 
assert that for each g e G— N, f (gh) =f (g) for almost all h € Go. If we write 


fil) = i} f(gh)dAo(h), 


then f, is Borel, and for ge G—N, f,(g)=f(g). Moreover, for any fixed 
geG—N and k E Gao the functions h > f(gh’h) and h > f(gh) are both 
equal to f(g) for almost all h. Consequently, f,(gh')=f,(g) for all 
(g,h')e(G—N)xG,). Consider now the set X’=f[G—N]. X’ is a à~ 
measurable subset of X by corollary 5.2, and hence B-1(X’) is a A-measur- 
able set which is a union of left G-cosets, and, for all (g,h) e B~*(X') x Go, 
fi(gh)=f1(g). Moreover, X—X’ is null and so G—B~1(X’) is also null. 
Define now the function f, on G by 


fag) = ra (g e B~*(X’)), 
° 0 (g ¢ BX’). 


fo is A-measurable and f,(g)=f(g) almost everywhere. Further, fa is 
constant on the left G -cosets, and therefore there exists, by corollary 
5.20, a A~-measurable function f on X such that f,=/,°. Choose a Borel 
function a and X such that a(x) = f(x) for almost all x. Then f(g) =a(B(g)) 
for almost all g. This completes the proof. 


(34) 
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5. COCYCLES AND COHOMOLOGY 


The functional equations (22) are extremely significant in the theory 
of induced representations associated with G'-spaces. We shall devote the 
present section to a study of these equations in an appropriately general 
context. 

Let X be a standard Borel G-space, G being a fixed lese group. We choose 
an invariant measure class @ on X. Let M be a standard Borel group, 
fixed throughout this section. We write 1 for the identity of M. Let f be 
a function on Gx X with values in M. We shall say that f is a (G,X,M)- 
cocycle relative to @ if the following properties are satisfied: 


(i) fis a Borel map of Gx X into M, 
(35) (ii) f(e) = 1 foralmostall xex, 


(ii) f(9:92,%) = f(91:92°%)f(G2,x) for almost all 
(91:92,%) EAxGxX. 


If there is no ambiguity concerning the group M, we shall refer to f 
simply as a (G,X)-cocycle relative to € or, even more briefly, as a cocycle 
relative to €. Note that the null set in (ii) is with respect to @ while the 
null sets in (iii) are those of the measure class which is a product of the 
usual measure classes in G with @. 

A cocycle f is said to be a strict cocycle if it satisfies (35)(i) and further, 
(36) (ii’) and (iii’), where 


(i) f(e~7) =1 forall xex, 


36 
(36) (iii) fga) = f(91,92°%)f(G2,%) forall (g1gox)EGXGXX. 


Our decision to call the functions satisfying (35) and (36) cocycles is 
prompted by the fact that these equations are generalizations of the 
identities which describe the cocycles in the cohomology theory of groups 
(cf. Eilenberg [1], pp. 3-37). We note here that it was G. W. Mackey who 
first studied the cocycles in the context of an arbitrary transitive Borel 
G-space. It was the detailed study and analysis of cocycles which enabled 
Mackey to formulate and prove the correct generalizations of the classical 
work of Frobenius on induced representations (of finite groups), to the 
case when the group in question was an arbitrary lesc group. The aim of 
this section is to carry out a cohomology theory of (G,X,M)-cocycles. 

We begin by introducing equivalence relations in the set of all cocycles 
relative to a fixed invariant measure class @. Suppose that fı and f, are 
two cocycles relative to @. We shall say that f, and f are cohomologous, 
fi~fe in symbols, if there exists a Borel map b(x — b(x)) of X into M 
such that 


(37) Jalg) = blg: x) fı(g;æ)b(x) 
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for almost all (g,x) e Gx X. By taking b to be identically 1, we see that if 
fı and fa are equal almost everywhere on G x X, then f,~f.. It is easy to 
verify that ~ is a genuine equivalence relation in the set of all cocycles 
relative to @. Each ~ equivalence class will be called a cohomology class 
relative to @. Suppose further that fı and fo are strict cocycles. We shall 
then say that f, and f, are strictly cohomologous, fı X fz in symbols, if there 
exists a Borel map b(x —> b(x)) of X into M such that 


folg,x) = b(g-2x)f,(g,2)b(x) ~* 


for all (g,x) € Gx X. x is also easily verified to be an equivalence relation 
in the set of all strict cocycles. Each x -equivalence class will be called a 
strict cohomology class. 

The map 1 : g,2 —> 1 is a strict cocycle. Any cocycle cohomologous to it 
will be called a coboundary. If the cocycle is strictly cohomologous to 1, 
it will be called a strict coboundary. Obviously, a cocycle f is a coboundary 
(strict resp.) if and only if for some Borel function b on X with values in M, 
f(g;£) =6(g-x)b(x)~* for almost all (all resp.) (g,7) Ee Gx X. 

Suppose that M is abelian. Then the set of Borel maps of Gx X into M 
is an abelian group under pointwise multiplication; both the set of cocycles 
and the set of strict cocycles are subgroups of this group. The coboundaries 
form a subgroup of the group of cocycles and for two cocycles fi, fo, fi~ fo 
if and only if f,f.~1 is a coboundary. Therefore the cohomology classes 
are in one-one correspondence with the elements of the quotient group of 
the group of cocycles mod the group of coboundaries. This quotient 
group is known as the (G,X,M)-cohomology group. 

From now on we shall assume that G is transitive on X. We choose a point 
zo E X arbitrarily and write G, for the stability subgroup at x. Gp is 
closed. 8 denotes the map g —> g-x, of G onto X. B is a Borel map. For any 
map u of X into some space Z, u? denotes the map g —> u(A(g)) of G into Z. 
On the other hand, if v is a map of G into Z which is constant on the left 
G'o-cosets, there exists a unique map u of X into Z such that uw°=v; we 
denote u by v~. Thus (v~)? =v and (u°)~ =u. 

We shall also extend this notation to two variables. Let Y, Z be Borel 
spaces and let u be a map of Y x X into Z. We then write u° for the map 


y, g —> u(y,B(9)) 


of Y xG into Z. We note that u is Borel if and only if u° is Borel. 

At various points of our arguments in this section, we shall be con- 
cerned with functions on X, G, Gx G, Gx X, Gx Gx X, and so on. Since 
G and X possess unique invariant measure classes, there are canonically 
determined measure classes on each of the other spaces. For instance, if A 
and à’ are quasi-invariant o-finite measures on G and X, respectively, the 
measure Àx A’ determines a measure class on G x X which depends only 
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on the measure classes of A and A’. By a (Borel) null subset of Gx X we 
mean a set on which all the measures of this class vanish. Similar remarks 
apply to GxG@xX, etc. In view of the uniqueness of the invariant 
measure classes, we are entitled to omit any reference to them and speak 
of cocycles and coboundaries. This we shall do. 

Suppose now that f is a strict cocycle. Then it follows easily from (ii’) 
and (iii’) of (36) that A —> f (h,x) is a Borel homomorphism of the stability 
group G, at x into M. We shall call it the homomorphism defined by f at the 
point xe X. If m, and m, are two Borel homomorphisms of G into M, 
we shall say that m, and m, are equivalent if there exists a k e M such that, 
for all h € Go, 


(38) mlh) = km,(h)k7}. 


The cocycles and their cohomology classes are difficult to handle 
because of the null sets involved. The strict cocycles and their classes are 
much better behaved in this respect since there are no null sets. Our 
main concern is to show that in every cohomology class of cocycles there 
is a strict cocycle and that the latter is determined uniquely up to strict 
cohomology. This means that the cohomology classes of cocycles are in 
canonical one-one correspondence with the strict cohomology classes of 
strict cocycles, which are easier to analyze. 

The simplest strict cocycles are the strict (G,G@)-cocycles. Our method 
of examining strict (G,X)-cocycles consists in “lifting”? these to strict 
(G,G)-cocycles and analyzing the latter. 


Lemma 5.23. Let f be a strict (G,X)-cocycle. Then the function f° on Gx G 
defined by 


f°(9,9') = f(9.B(9’)) 


is a strict (G,G)-cocycle. If F is any strict (G,G)-cocycle, there exists a unique 
Borel map b of G into M such that b(e)=1 and 


(39) F(g,g') = 6(gg')b(g’)~ * 
for all (gg) EGxG. 


Proof. The first assertion is trivial. For the second, we note that (36) 
gives 
F(g,9') = F(gg'.e)F(g',e)~* 


for all (g,g')e Gx G. Equation (39) follows by putting b(g’)= F(g’,e). If 
b, is another Borel map of G into M such that b,(e)=1 and F(g,g')= 
b,(9g’)bi (9) ~t, then b(gg’) ~ *b1(99") = b(g’) ~ *b1(9") for all (9,9) e G x G. This 
shows that for some k e M, b(g)~1b,(g)=& for all g. Since b,(e)=b(e) =1, 
k=1 and b=0),. 
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In view of this lemma, with any strict (G,X) cocycle f, we can associate 
a Borel function on G. But f° is not the most general (G,G)-cocycle since 
it arises from a (G,X)-cocycle. This Borel function must therefore satisfy 
some special identities. The next lemma describes these identities. 


Lemma 5.24. Let m be a Borel homomorphism of Ga into M. Then there 
exists a Borel map b of G into M such that b(e)=1 and 
(40) b(gh) = b(g)m(h) 


for all (g,h) E Gx Go. Corresponding to any such map b, there is a unique 
strict (G',X)-cocycle f such that 


(41) f°(9,9') = B(gg')b(g')-* 

for all (g,9')EGxG; f defines m at xo. Conversely, if f is a strict (G,X )- 
cocycle and b is the Borel map of G into M such that b(e)=1 and b satisfies 
(41), then the restriction of b to Gy coincides with the homomorphism m defined 
by f at x, and b satisfies (40). 


Proof. Let m(h — m(h)) be a Borel homomorphism of G into M. Let 
c(x —> c(x)) be a Borel section for G/G ). We may assume c so chosen that 
c(xọ)=e. For any g e G, let a(g)=c(B(g))~*g. Since g and c(B(g)) lie on the 
same left coset gGy, a(g) € Go. We put 


(42) b(g) = m(a(g)). 
Since c and ß are Borel maps, g —> a(g) is a Borel map of G into Gy and 
hence g — b(g) is a Borel map of G into M. Also b(e)=m/(a(e))=1. More- 
over, for g € Gand he Go, a(gh) =a(g)h. This shows that 
b(gh) = 5(g)m(h) 
for all (g,h) e Gx Gy. Let 
F(g,9') = b(gg')b(g")~*. 
F is obviously a strict (G,G)-cocycle and (40) shows that F'(g,g'h)= 
F(g,g') for all (g,g',h) e Gx G x Gy. Hence, there exists a unique Borel map 
f of Gx X into M such that 
f(9.B(9')) = Figg’) 

for all (g, g') e Gx G. It is easily checked that f is a strict (G,X)-cocycle, 
that f°=F, and that it is the only (@,X)-cocycle with this property. 
Finally, for h € Go, f(h,x9) = F(h,e) =m(h). 

For the results in the converse direction, let f be a strict (G,X)-cocycle 
and let b be such that b(e)=1 and 


(43) f°(g.9') = B(gg')b(g’)-* 
for all (g,g') e Gx G. Then, for all (g,g',h) eG x Gx Go, 
b(gg'h)blg'h) => = b(gg’)b(g’)~*; 
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from which it follows easily that for each he Go, g’ —> b(g’)~ 1b(g’h) is a 
constant, say m(h). Thus b(g'h)=b(g')m(h) for all (g,h) e Gx Go. Since 
b(e)=1, b(h)=m(h), while (43) implies that b(h)=f(h,x9), for all k E€ Go. 
This proves everything. 


Lemma 5.25. Let f, be a strict (G,X)-cocycle and let m; be the homomorph- 
ism defined by f; at £o (t=1, 2). Then the following statements are equivalent: 


(a) m and m, are equivalent, 
(44) (b) fixa 
(c) fi~fe- 
Proof. (b) = (c) is trivial. We shall prove that (a) => (b) and (c) = (a). 
(a) = (b). Suppose that m, and m, are equivalent homomorphisms. 
Now there exists by lemma 5.24, a Boral map b, of G into M such that 
(i) m; is the restriction of b; to Go, (ii) bi(gh) =b,(g)m,(h) for all (g,h) E Gx Go 
and (iii) f,°(9,9’) =6,(g9’)b,(g’)~? for all (g,g')}EGxG. Let ke M be such 
that 


(45) m(h) = km,(h)k-? 
for all h e Go. From (45) we obtain the equation 


bo(gh)kb,(gh)-* = bo(g)kb,(g)~* 


for all (g,h) € Gx Gy. This shows that the Borel map g —> 6,(g)kb,(g)~} is 
constant on the left G,-cosets. Therefore there exists a Borel map 
b(x —> b(x)) of X into M with the property that 


(46) 62(9)kb,(g)~* = 6(B(g)) = 6°(9) 


for all g e G. An easy computation using (46) and the relation of b; to fi 
now gives us the relation 


£2°(9,9') = 5°(99')f1°(9,9')b%(9') t, 
from which we obtain 
folg.z) = 6(g-x)f,(g,%)b(x)~* 
for all (g,xz)e Gx X. 


(c) > (a). We assume fı ~f. Therefore there exists a Borel map b of 
X into M such that 


folg.v) = blg- x) filg x)b(x) 
for almost all (g,x) e Gx X. This implies that 
(47) f2°(9:9') = 6°(99') f1°(g,9')0°(9') -* 


for almost all (g,9') e Gx G. Let b; be the Borel map of G into M such that 
(i) m; is the restriction of b; to Go, (ii) f,°(g,9')=0;(gg')b,(g’)~? for all 
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(9,9')eGxG@ (lemma 5.24). We then obtain from (47), after an easy 
calculation, the equation 


(48) 61(99') ~*b°(gg') = *ba(gg’) = 51(g')~ *8°(g') ~ *B2(9’) 


for almost all (g,g’)eGxG. This means (corollary 5.21 with X =G) that 
the function g —> b,(g)~1b°(g)~4b.(g) is almost everywhere equal to a 
constant, say k~t e M. Thus 


(49) bo(g) = 6°(g)by(g)k-* 


for almost all g e Œ. We now conclude from the connection between b, 
and m, that for each he Gp, 


b°(g)bi(g)k- malh) = 6°(g)bs(g)my(h)k-? 
for almost all g. Thus 
(50) Mlh) = km,(h)k7}. 
This proves that (c) = (a) and completes the proof. 


Lemma 5.26. If f is any (G,X)-cocycle, there exists a strict (G,X)-cocycle 
fı such that 


f (9,2) = fi(9,%) 


for almost all (g,4)EGxX; fı is determined uniquely up to strict 
cohomology. 


Proof. Let f be a (G,X)-cocycle and let f° be defined as usual, i.e., 
f°(9.9') = f(9:B(9’)). 
Then 
£°(9192:93) = (91,9293) f°(92:93) 
for almost all (9,,92,93) E Gx Gx G. Hence there exists a g, € Q such that 
S919290) = £°(91:9290)f°(G2:9o) 


for almost all (g,,9.) €GxG. If we put g=9,, 9’ =gogo, we can conclude 
that 


£°(9,9') = SII Jogo) fg Jogo)? 
for almost all (g,g') e Gx G. Let 
dolg) = J°(9go™ *,9o)- 
Then do is a Borel map of G into M and 


f°(9,9') = dolgg')dolg') ~? 
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for almost all (g,g’) c G x G. Changing d,(g) to d(g) =d)(g)do(e)~ 1, we secure 
d(e)=1 and at the same time 


(51) figg) = d(gg')d(g') ~? 
for almost all (gg) EG@GXxG. 
Fix now an arbitrary h e Gy. Then it follows from (51) that 


d(gg'h)d(g’'h)~* = d(gg’)d(g’)~* 
for almost all (g,g’) Ee G x G, i.e., 
d(gg')~*d(gg'h) = d(g’)~*d(g'h) 
for almost all (g,g')eG x G. Corollary 5.21 (with X =Q) implies now that 


the function g’ — d(g’)~!d(g'h) is almost everywhere equal to a constant, 
say m(h). In other words, 


(52) d(gh) = d(g)m(h) 


for almost all g. Clearly m(h) is uniquely determined by the requirement 
that the condition (52) be satisfied for almost all g. It now follows from 
(52) and the uniqueness of m(h) that (i) m(e)=1, and (ii) for h,, hz E Go, 
m(hyh2)=m(h,)m(hq). We assert that mis a Borel map of G into M. Since 
M is standard, we may assume, in this argument, that M is the unit 
interval. Let o(g,h)=d(g)~*d(gh). Then p is Borel on Gx Gy. Choose a 
quasi-invariant measure À on G such that A(G)=1. Then 


m(h) = f p(g,h)dA(g). 


As g is Borel on G x Go, this shows that m is Borel on Go. In other words, m 
is a Borel homomorphism of Go into M. 

By lemma 5.24 there exists a Borel map d’ of G into M such that 
d'(e)=1 and d'(gh)=d'(g)m(h) for all (g,h) E Gx Go. From (52) it now 
follows that for each h € Go, 


d(g)d'(g)~* = d(gh)d'(gh)~* 


for almost all g. By corollary 5.22 we can find a Borel map k of X into M 
such that d(g)=k(B(g))d’(g) for almost all g. Write d,(g)=k(B(g))d’(g). 
Then d,(gh)=d,(g)m(h) for all (g,h) € GxG, and d,(g)=d(g) for almost 
all g. By lemma 5.24, there exists a unique strict (G,X)-cocycle f, such 
that 

f1°(9,9') = dilgg')di(g') ~? 


for all (g,g')e G xG. Since dı =d almost everywhere, the last equation 
implies that f° =f,? almost everywhere on G x G. Thus 


(53) filg.z) = f(9,2) 
for almost all (g,x) e G x X. 
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If fa were another strict cocycle such that (53) is satisfied with f in 
place of f,, then fı =f, almost everywhere on G x X. In particular, fi~ fo. 
From lemma 5.25 we now conclude that fı% f}. This completes the proof. 


Theorem 5.27 (Mackey [2], [6]). Let G be a Icsc group, X a standard 
transitive Borel G-space, and M a standard Borel group, Xo, an arbitrary 
point of X, and Go, the stability group at xo. If y is any cohomology class of 
(G,X,M)-cocycles, the set yq of all strict (G,X,M)-cocycles in y 1s nonempty 
and is a strict cohomology class. The set y~ of the homomorphisms of Go 
defined by the elements of yy at xo 1s an equivalence class of Borel homomorph- 
isms of Gy into M. y — y~ is a one-one correspondence of the set of all 
cohomology classes onto the set of all equivalence classes of Borel homo- 
morphisms of Go into M. If M is abelian, the (G,X,M)-cohomology group is 
isomorphic to the group of all Borel homomorphisms of Go into M. Finally, 
if X is an affine G-space (1.e., Go ts trivial), all cocycles are coboundaries. 


Proof. The proofs of all the statements are contained in lemmas 5.23 
through 5.27. 


Remark. The theorem just proved is the cornerstone of the theory of 
induced representations. It reduces the problem of construction and 
classification of cocycles to the problem of constructing the equivalence 
classes of Borel homomorphisms of G into M. Given any Borel homo- 
morphism of Gy into M, lemma 5.24 tells us the method of constructing 
the strict cocycles which define this homomorphism. This construction 
depends however on the choice of a Borel section for the cosets gG@y. In 
general, there are no canonical choices for these sections. We shall see later 
on how, in at least some special cases, these difficulties can be overcome. 


Remark. In many problems, M is already a topological group with a 
second countable topology. It is easy to indicate a sharpened form of 
theorem 5.27 which can be formulated in this context. We shall say that a 
map u of a topological space A into a topological space B is locally 
bounded if for any compact set ACA, u[K] has compact closure in B. A 
mild sharpening of theorem 5.27 is contained in lemma 5.29. To give this 
we need a preparatory lemma, which is classical. 


Lemma 5.28. Let G be a Icsc group, and M a topological group satisfying 
the second axiom of countability. Suppose that M is given its natural Borel 
structure and that m is a homomorphism of G into M which, as a map G into 
M, is measurable. Then m is continuous. 


Proof. Since we can replace M by the range of m, we shall assume that 
m maps G onto M. Let U be an open subset of M containing 1 and W an 
open set such that 1 e W and WW~-!cU. Since the open sets sW (s e M) 
cover M, there is a countable family {s,, 59,---,8,,---} such that 
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M=\),, SW. Choose g, € G such that m(g,)=s,. If m-1(W)=., then it is 
easily seen that G=\), 9,4. Let u be a left Haar measure. By assumption, 
the sets g,A are all measurable and hence u(A)>0. By a well known result, 
there exists an open set B such that ee BC AA™!. Since WW-!CU, it 
follows that m[B]¢ U. This proves that m is continuous at e. Since m is a 
homomorphism, m is continuous everywhere. 


Lemma 5.29. Let the notation be as in theorem 5.27. Suppose that there is a 
second countable topology on M which converts M into a topological group 
and whose associated Borel structure is the given one. Then, any cohomology 
class y contains a strict cocycle which is locally bounded as a map of Gx X 
into M. Moreover, if there is a continuous section for G/Go, y contains a 
strict cocycle which is a continuous map of Gx X into M. 


Proof. It is enough to prove that given a Borel homomorphism m of Go 
into M, there are strict cocycles defining m at x, and having the required 
properties. Now, m, by lemma 5.28, is continuous. Let c(x—>c(x)) be a 
regular Borel section for G/G, (theorem 5.11) such that c(z,)=e. For 
t e G, h(t)=c(B(t))~*t. Then it follows from the regularity of c that h is a 
locally bounded map from G into Gp. If b(g)=m(A(g)) for all g e G, b is a 
locally bounded map of G into M. g, g' — b(gg')b(g')~1 defines a strict 
(G,X )-cocycle f; f is clearly locally bounded and defines m at x. If c is a 
continuous map of X into G, f is even continuous. 


Remark. The reader would have noticed that the transitivity of the 
Borel G-space X has played a crucial role in the main theorems of this 
section. It is a natural question to ask whether some of these results can 
be generalized to the case when X is not homogeneous. 

Suppose then that X is an arbitrary standard Borel G-space and @ an 
invariant measure class. The measure class will now have to play an 
explicit role since it is not unique unless X is transitive. With respect to a 
given measure class @, the problem of describing the cohomology classes of 
cocycles is then well posed. When there is an orbit G@-x, such that 
X — G- xo is a null set for @, this problem is essentially the problem for the 
transitive Borel G-space X,=G'-x . Such measure classes on X we shall 
call transitive. 

However, not every invariant measure class is transitive. Let us call a 
measure class @ which is invariant, an ergodic measure class if any Borel 
function f, having the property that for each g e G, f(g-x)=f(x) for €- 
almost all x, is constant almost everywhere. Transitive measure classes 
are ergodic but not conversely. When X is not transitive under G, there 
will often exist ergodic invariant measure classes @ which are not transi- 
tive. For any such @ the problem of describing all the cohomology classes 
seems quite difficult. 


MEASURE THEORY ON G-SPACES 183 


If @ is transitive and G acts freely on X, i.e., for -almost all x e X the 
stability subgroups are trivial, i.e., consist only of {e}, then there exists 
obviously an xo € X such that X —G- x, is @-null and G,,, =e. In this case, 
there are no cocycles other than coboundaries. When @ is assumed to be 
merely ergodic, this is not true any longer. We shall now construct 
examples where @ is ergodic, G acts freely on X, and yet there are cocycles 
which are not coboundaries. 


Example 1: G is the discrete additive group of integers, X is the multi- 
plicative group of complex numbers of modulus 1, and M =X. p is the 
Haar measure on X. We denote by z an element of X such that the powers 
Zo" (n=1,2,---) are dense in X. For ne G and ze xX, n[z]=z,"z. The 
measure class of u is ergodic, as may be verified easily. 

Suppose that f is a Borel map of X into M. If we define, for 
(n,z)EGx X, 


f(%)f (20x) - -f (z071) (n > 1), 


r(n,x) = <1 (n = 0), 
J20 >a) -f (z"x) 7} (n < 0), 


then r, is a strict cocycle, as is easily verified, and 
r;(l,£) = f(z). 


It is obvious that every cocycle is equal almost everywhere on G x X to 
some r;. We now ask when r, is a coboundary. Clearly, this is so if and only 
if there exists a Borel map b of X into M such that 


r,(n,2) = b(zo"zx)b(z) ~? 
for -almost all x. In particular, 
b(x) f(x) = b(zox) 


for almost all x. Now, not every f can satisfy this equation for suitable b. 
In fact, we have, for f (x)=x, 


a~ — -KÎ 
6441 = Z207 “Dx, 


where k —> b, is the Fourier transform of b, i.e., 
b, = f b(x)a*dp(x), (k = 0, +1, +2,---). 
x 


This implies that |6,|? is independent of k, and hence that 6, =0 for all 
k since b e ¥?(X,u). This means that b=0, a contradiction. 


Example 2: In this example, G= R and M the multiplicative group of 
complex numbers of modulus 1; it is due to Helson and Lowdenslager [1]. 
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We make frequent use of the Pontrjagin theory for locally compact 
abelian groups (Pontrjagin [1], Kaplansky [2], pp. 54-56). 

We write R for the additive group of reals in the usual topology and 
Rê for the additive group of the reals in its discrete topology. Let D be a 
countable subgroup of Rê and X the multiplicatively written character 
group of D. In the (usual) topology of pointwise convergence, X is a 
compact metrizable group. Since no element of D is of finite order, X is 
connected. For àe D and xe X we write <A,2> for the value of x at A. 
For any te R, À — e^ (Ae D) is an element of X. We write é for this 
element. t > f is a homomorphism of R into X. We note that the image of 
R under this homomorphism is dense in X. Otherwise, by the duality 
there would exist a A440 in D such that 


eitA — ] 


for all te R, which is impossible. If D contains at least two elements 
which are rationally independent, t—>? is injective, as may be easily 
verified. 

The injection t — Ê gives rise to an action of R on X if we set 


t-x = tx ((eRh,xeEX). 


It is obvious that t, x —>t-x is a continuous map of Rx X into X and 
that X becomes an R-space. The Haar measure p on X is invariant and 
ergodic under this action. Our object is to exhibit (R,X,M )-cocycles which 
are not coboundaries. 

Suppose àe D. Then the function p, : t, æ —> <A,2>(1—e!##4) is con- 
tinuous on Rx X and p, (¢+u,x2) =, (t,u-x)+y,(u,x) for all t, we R and 
x € X. However, p, is not real valued. Hence we pass on to p}, where 


prlt,x) = <A,x>(1—el) +<—A,x>(1—e-"). 


Then o, is real and exp(ig,) is easily verified to be a strict (R,X,M)- 
cocycle. This cocycle, and hence any finite product of such, is obviously a 
coboundary. However, things change quite a bit when we take infinite 
products of the cocycles exp(ip,). We shall show that many of these 
infinite products, when convergent, represent cocycles which are not 
coboundaries. 

Let us now assume that c,,co,--- and Aj, Ag,--+ are sequences of 
positive numbers such that 


(a) ¢;—> +00, 
(b) A,;ED and the à, are rationally independent, 


(c) > AG < o. 
j=1 
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Put co=0, c;=c_,;, A_;=—A,. Then the series X3 o ¢<A;,%>(1— e^) 
converges uniformly when t varies over a bounded set and x varies over X. 
It therefore represents a real valued function ọ continuous on Rx X. As 


œo 
p = > CiP 
j=1 


p =exp(ig) is a strict cocycle. We shall prove that 4 is not a coboundary. 
Suppose 7 is a coboundary. Then there exists a Borel map b of X into M 
such that for 


(*) pltw) = b(tar)b(x)-* 


for almost all (t,x)e RxX. Write a, for the map x —> (t,x), and for 
y E€ X, let a, be the map x — b(yx)b(x)-!. Then y —> a, is a continuous 
map of X into the Hilbert space X?(u). This implies that (*) is valid for 
each ¢ for almost all x. 

Suppose now that y e X is such that <à yy =1 for all j; then we claim that 
a, is a constant (of absolute value 1) for almost all x. To see this, let us 
consider, for fixed s e R and each y € X, the function c,, where 


cy(x) = exp{ (= > KEN yap) (=e) h- 
j=- o 

Since s is fixed, the series under the exponential is uniformly convergent 

as x,y vary over X and hence x, y —> c,(x) is a continuous function on 

X x X. Hence y — c, is a continuous map of X into ¥?(u). But if we put 

y=t forte R, an easy calculation shows that 


ci(x) = p(t,s-x)p(t,x) 7! 


= a;(s-x)a;(x)~*. 


The continuity of the maps y —> c, and y —> a, shows that for each y € X, 
we must have 


c,(£) = a(s- z)a(2)~? 


for almost all x. If now ye X and <A,,y>=1 for all j, c,(z)=1 and hence 
a,(s-x)=a,(x) for almost all z. As R acts ergodically on X, a, must be a 
constant almost everywhere. This constant is obviously of modulus 1. 

The conclusion of the preceding paragraph can be sharpened. Suppose 
ye X is such that <à yy =1 for j#k and <A,,y> = —1(k, j2 1). Then, the 
result of the previous paragraph, applied to the strict cocycle obtained by 
dropping the terms with indices + k, shows that for some complex number 
z of modulus 1, 


a(x) = z exp{4ic, Re <p £Y} 
for almost all x. 
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We are now in a position to give the decisive argument. Since the A, 


are rationally independent, there exists, for each kK=1,2,--- a ype X 
such that 
l1 j+k, (jf 2 1), 
awos > 7 
— j=k. 


Write a,’ for a,,. Since X is compact, there exists a subsequence {y, } 
and a Yo € X such that Yk, > yo. Then ap, —> a,,. AS (A;,Yo> =1 for all J, 
we know that a,,(x)=7 for almost all x, r being a constant with |r|=1. 
Hence 


Í ak, dp —> T. 
X 


On the other hand, we claim that, since cy —> +00, 


Í ay dp —> 0. 
x 
In fact, consider the integral 


Jo, = Í exp{4ic Red), }du(x) 


for c>0. If A(z) #1 for some x € X, then x —> ¢A,x7> maps X onto a com- 
pact nontrivial subgroup of M, which is connected as X is connected, and 
so coincides with M. The measure u then goes over to Haar measure on 
M and hence 


Jo. = f exp{4ic Re m}dm 
M 


1 (7 . 
= 5 f exp(4ic cos 0)d 6. 
Je a is thus independent of À and —> 0 as c—> œ. This proves, as we 


asserted, that 
f ay du —> 0. 
X 


This contradiction shows that our cocycle ¢ is not a coboundary. 

We shall end this section with an application to the theory of quasi- 
invariant o-finite measures on standard transitive Borel G-spaces. For a 
direct derivation of the results which are about to be obtained now, the 
reader may consult Mackey’s paper [3]. We write R , for the multiplicative 
group of positive reals. 

Let G be a lesc group, Go a closed subgroup, X=G/G) and £ the 
canonical map of G onto X. Let B(e)=2,. Suppose that F is a positive 
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Borel function on G x X and a is a quasi-invariant o-finite measure on X. 
We write 


(54) Fura 
if for each g e G, x > F(g,x) is a version of da/da9~”. 


Theorem 5.30. Let A and A, be the modular functions of G and Go, 
respectively. Write 


(55) mh) = A(h)Ag(h)-* — (hE Go). 


Then, for any quasi-invariant o-finite measure a, there exists a strict 
(G,X,R,)-cocycle r such that r~a. Any such r defines the homomorphism m 
at xo. Conversely, if r is a strict cocycle defining m at £o, r~a for some quasi- 
invariant o-finite measure a; r determines a up to a constant multiplying 
factor. Finally, an invariant o-finite measure on X is necessarily a Borel 
measure, and there exists one such on X if and only if A(h)=A,(h) for all 
he Gp. 


Proof. We begin our proof with a simple observation. Let SCG be a 
subgroup such that G—S is measurable and has measure zero. Then 
S=G. In fact, if SAG, and ge G—S, gS<G—S so that gS has measure 
zero. This implies that S is a null set, a contradiction. 

This said, let us consider a quasi-invariant o-finite measure a on X. 
Let F be a positive Borel function on Gx X such that for almost all g, 
a> F(g,x) is a version of da/da%-» (theorem 5.10). Equation (22) shows 
that F is a (G,X,R,)-cocycle. Hence, by lemma 5.26 there exists a strict 
cocycle r such that r= F almost everywhere on G xX. For each ge G, 
let f, be a version of da/da@~» and let 


S = {g : r(g,x) = f,(x) for almost all x}. 


Since the f, satisfy (20) and r satisfies (36), it is easily seen that S is a 
subgroup of G. Further, G—S has measure zero. Therefore S=G. In other 
words, r~a. 

If r’ is another strict cocycle such that r’~a, then r’=r almost every- 
where on Gx X. Hence r’xr by lemma 5.25. Thus, R, being abelian, 
r’ defines the same homomorphism of Go as r does, at xo. Let m, denote this 
homomorphism. We claim that m, is independent of «. Suppose that a’ is 
another o-finite quasi-invariant measure. Let a be a positive Borel function 
which is a version of da’/da. Then it is quickly verified that (with r~a) 


(56) r : g, x —> a(x)r(g,x)a(g-x)~} 


is a strict cocycle such that 7r’~a’. But then, r' xr, so that my =m, Let 
mo denote this homomorphism of Go, determined by all the measures «. 
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Suppose now that r is a strict cocycle defining the homomorphism m, 
at 2. Let œ, be a o-finite quasi-invariant measure and r, a strict cocycle 
such that r, ~«,. Since 7, also defines m, at £o, 7, 27. Hence there exists a 
Borel map b of X into R, such that 


(57) r(g,x) = B(x)r,(g,2)b(g-x)~* 


for all (g,x) e Gx X. If a is the o-finite quasi-invariant measure such that 
da/da,=b, then (57) shows that r~a. If «’ is another o-finite quasi- 
invariant measure such that r~ a’ also, then a’ is a constant multiple of a. 
In fact, let a be a positive Borel function such that a is a version of 
da'/da. Then for each g, a(x) =a(g-x) for almost all x, showing that a is a 
constant almost everywhere. 

Suppose now that X admits an invariant o-finite measure, say y. Then 
the strict cocycle 1 : g, x — 1 is such that 1~y. Since 1 defines the trivial 
homomorphism at xo, mp is trivial. Conversely, let mọ be trivial and let « 
be a o-finite quasi-invariant measure. Let r be a strict cocycle defining 
My at x such that r~a. Since mo is trivial, ral, and so there exists a 
Borel map b of X into R, such that 


(58) r(g,x) = b(g-x)b(x)~* 


for all (g,x) e Gx X. The equation (58) shows that the o-finite measure y 
defined by the equation 


dy/da = b 


is invariant. In other words, X admits a o-finite invariant measure if and 
only if mo is trivial. 

It remains to prove that m)>=m. To this end we use the results and 
notations of lemmas 5.14 and 5.15 and theorem 5.19. Using the argument 
of lemma 5.29, we construct a Borel map d of G into R,, such that 
(i) d(e)=1 and dis a locally bounded map of G into R, (ii) there exists a 
strict cocycle r such that r°(g,g')=d(gg’)d(g’)~* for all (g,g') e Gx G, and 
(iii) r defines the homomorphism m of Gy at 29. Let y be the Borel measure 
on G defined by the linear functional (also written y) 


y:f> f fig)d(g)-dulg), (FECA) 


p being a left Haar measure on G. We claim that y vanishes on the kernel 
of the map f —> f~. In fact, suppose f € C,(@) is such that 


; f(gh)duo(h) = 0 
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for all g in G. Multiplying by u-d-+ where u e C,(G) and integrating with 
respect to u, we get 


J, (a f ghydpo (h) Julo)d(g) = dug) = 0. 


We interchange the order of integration and change g to gh while integrat- 
ing with respect to u. We then obtain, on using the tact that d defines m 


at Xo, 
f (A00 |, Wgh -DA Ang) dolh = 0. 


If we now change the order of integration once again we obtain 


(59) f u~ (B19))d(g) -f (g)dulg) = 0. 


Since the map u —— u~ maps C,(@) onto C.(X), we can choose u so that 
~ is identically 1 on the set [K], where K is a compact subset of G out- 
side of which f is zero. Then (59) leads to the equation 


(60) fr f(g)d(g)~2dulg) = 0. 


In other words, there exists a linear functional y~ on C,(X) such that 
y(f)=y~(f~) for all feC(G). Moreover, we know from lemma 5.15 that 
given any véC,(X), which is real and >0, there is a weEC,(G) such 
that u~=v and u>0. This shows that y~ is a nonnegative linear 
functional. Therefore there exists a Borel measure a on X, such that y~ 
corresponds to «œ. Then, obviously y=a°. Since y is quasi invariant, so is 
a (corollary 5.17). Fix geG. We shall compute da/da—», Let t be a positive 
Borel version of it on X. Then, by corollary 5.17, t° is a version of dy/dyo-»), 
But, since dy/du=d-!, we obtain 


t°(9’) = d(gg’)d(g’)-* = r°(9,9') 


for almost all g’. In other words, for each g € G, x > r(g,x) is a version of 
dalda”. This proves that r~a. By our definition of my, m= mp. 

The above argument shows actually that if r is a locally bounded strict 
cocycle defining m, there exists a quasi-invariant Borel measure «œ such 
that r~a. Suppose now that X admits an invariant o-finite measure a. 
m is then trivial and our remark at the beginning of this paragraph shows 
that there exists a Borel measure y such that 1~y. But this implies that « 
is a constant multiple of y by corollary 5.21. a is thus a Borel measure. 
The proof of the theorem is complete. 


Corollary 5.31. If G is unimodular, G/Gy admits an invariant Borel 
measure if and only if Gy is also unimodular. If G is arbitrary but Go is 
compact, then G/G, admits an invariant Borel measure. 
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We end this section with an example of a unimodular G and a non- 
unimodular Gp. Let G=SL/(2,C), the group of 2x2 matrices g 


a b 
(61) g = ( ) (ad —be = 1) 
c d 


of determinant 1. g >a, b, c, d gives an injective homeomorphism of G 
into C*. The group G, equipped with the relative topology obtained by 
imbedding it as above in (+, is a lesc group. Let Go be the subgroup of all 
elements ta ,, where 


(62) tas = (0 í ) (a,ze C, a #0). 

' 0 a7} 
Go is closed. We claim that G is unimodular and G, is not, so that X =G/G) 
does not admit any o-finite invariant measure. 

For the unimodularity of G we shall be more general and prove that G 
admits no nontrivial continuous homomorphism into R,. Suppose that 
m is one such. Since m(x)=m(yxy~*) for all x, y in G, it follows that for 
any matrix x in G with distinct eigenvalues, m(x)=m/(y) for all matrices y 
with the same eigenvalues as x. Let G” be the set of all matrices x in G 
with distinct eigenvalues and H the subgroup of diagonal matrices. Then, 
the restriction of m to H is a continuous homomorphism of H into R, 
with the property that m(x)=m/(y) whenever the diagonal entries of x and 
y are permutations of one another. It is easy to deduce from this that m 
is identically 1 on H. Since for any g e @ there is an aé€G@ such that 
aga~* e H, m(g)=1 for all g e G’. Now G’ is a dense subset of G and so m 
is identically 1 on G. In particular, G is unimodular. 

Go is not unimodular. To see this, let 


(63) H = {t, 9:46 0%, 


Then H is an abelian closed subgroup of G and is hence unimodular. If 
Gy were unimodular, then there would exist an invariant Borel measure 
on X,)=G,/H. Now in each left H-coset in Go there exists exactly one 
element of the form ¢, , (z € C); if we use the complex number z to param- 
etrize the coset t, „H, the action of Go on X, becomes 


(64) to. : E —> C+ cez. 


It is obvious that the space of complex numbers does not admit any 
Borel measure invariant under the maps (64). Thus G is not unimodular. 

It is of interest to note in this case that X =G/G, is actually compact. 
In fact, if K denotes the subgroup of unitary matrices in G, it is an easy 
consequence of the existence of the Jordan canonical form of an arbitrary 
matrix in G, that G= KG. Hence X is the image of K under the canonical 
mapping of G. In particular, X is compact. 
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Our main concern in this section is to examine how the topology 
of a lcsc group can be recovered from the measure-theoretic structure. 
Suppose that G is a lesc group and that p is a left Haar measure. Then it is 
classical that for any Borel set E with (E) >0, the set EE-? contains a 
neighborhood of e. This result hints that the topology of G is completely 
determined by u. That this is indeed so was proved by Weil [1] (cf. Halmos 
[1], pp. 257-289 for a detailed discussion of Weil’s results). For our 
purposes, this result of Weil is somewhat inadequate. The main theorem of 
this section is a sharpening of Weil’s theorem, first proved by Mackey [5]. 
We shall use this sharpened form of Weil’s result in a significant way in 
Chapter VII. 

This sharpening essentially comes out of the following observations. The 
first point is that starting with only a left invariant measure class on a 
separable Borel group G one can show that the group possesses a left 
invariant measure. The second point is that the Weil topology for G is 
second countable and generates the original Borel structure on G. The 
third observation now consists in imbedding G as a dense subgroup of a 
lese group G*. The fact that G is standard now implies that G is a Borel 
set in G*. This leads easily to the identity G=G*. In other words, every 
standard Borel group admitting a left invariant measure class is actually 
a lesc group under its Weil topology. This is the sharpening of Weil’s 
theorem we referred to. It is due to Mackey [5]. 

Since the main aim of our analysis is to obtain topological results from 
measure-theoretic assumptions, it is convenient to start with groups 
without any topology. Throughout this section, G will denote a separable 
Borel group, i.e., a group G with a Borel structure which converts G into a 
separable Borel space such that the map 2, y > zy~+ of Gx G into G is 
Borel. Since the left translations x —> zz are Borel automorphisms, we 
may consider measure classes invariant under these. We call them left 
invariant. In an analogous fashion we define right invariant measure classes. 
A measure class which is both left and right invariant is called bt-invariant. 


Lemma 5.32. Let G be a separable Borel group on which there exists a left 
(or right) invariant measure class. Then G admits a bt-invariant measure class. 


Proof. Let € be a left invariant measure class and A a finite measure in 
g. We shall prove first that for any Borel set ESG, s > A(Es) is a Borel 
function of s. In fact, the map T : s, t— s, ts isa Borel automorphism of the 
Borel space G x G and hence T[G x E] is a Borel set in Gx G. It is then 
obvious that 


Es = {t : (s,t)e T[G x EJ}, 


and hence the function s — A(Es) is Borel by Fubini’s theorem. 
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Define the measure A, by 
(65) à (E) = Í A(Es)dà(s). 
G 


A, is easily seen to be a measure and A,(G)=X(G)?>0. We claim that the 
measure class determined by À, is bi-invariant. It is enough to prove that 
if A,(£)=0, then A,(#z)=A,(z#) =0 for all z € G. Suppose A,(#)=0. Then, 
as A(Es) > 0, (E's) =0 for A-almost all s. Since Ae @ and @ is left invariant 
\(zH's) =0 for A-almost all s, showing that A,(z#)=0. On the other hand, if 
N is a Borel set such that A(N)=0 and A(Hs)=0 for seG—N, then 
A(z~1N)=0 and A(Ezs)=0 for se G—z-'N. This proves that A,(#z)=0 
also. A similar argument can be given starting with a right invariant 
measure class. The lemma is proved. 


Lemma 5.33. Let G be a separable Borel group and let @ be a bi-invariant 
measure class. Then © contains left invariant and right invariant o-finite 
measures. 


Proof. Let À be a finite measure in @, such that A(G)=1. Define à; by 
àS(E) = A(s +E). 


Then the measures {à} are all mutually absolutely continuous. Since 
s->À'(E) is a Borel function on G, corollary 5.6 is applicable and therefore 
there exists a Borel function r on GxG such that for A-almost all s, 
t —> r(s,t) is a version of dA/dA*~”. Since àt and A are mutually absolutely 
continuous, 7(s,t)>0 for almost all ¢ for A-almost all s. Hence r is positive 
(A x A)-almost everywhere on G x G. We may thus assume that r is positive — 
on GxG. 

The arguments of theorem 5.10 can now be used to conclude that for 
(Ax Ax A)-almost all (s,f,u) Ee Gx Gx G, 


r(st,u) = r(s,tu)r(t,u). 
Then there exists a positive Borel function b on G such that 
(66) r(s,t) = b(st)/b(t) 


for (A x A)-almost all (s,t) e Gx G. In fact, all we have to do is to imitate 
the argument of the first part of the proof of lemma 5.26; the only fact 
used there is the bi-invariance of the class of null sets involved which is 
certainly satisfied in our present setup. The measure u such that du/dA=b 
is then o-finite and belongs to @. From (66) we conclude that for A-almost 
all s, du/du~ is equal to 1 almost everywhere. Thus p=" for A-almost 
all s. Now the set S of all se G such that =p" is a subgroup. Since 
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this subgroup has a complement of A-measure zero, it must coincide with G 
(cf. remark in proof of theorem 5.30). This proves that u is left invariant. 
A similar argument yields a right invariant measure in the measure class 
@. The proof is complete. 


Let now # be a separable Hilbert space and X the group of all unitary 
operators on # We shall give the strong topology for Y; i.e., the smallest 
topology which makes all the maps U —> Uf (fe #) continuous. If 
{U,} is a sequence of elements in V, U, > V e Y if and only if U,f— Vf 
for all f belonging to some set D such that the linear combinations of 
vectors of D are dense in % 


Lemma 5.34. Y is a metrizable topological group satisfying the second 
axiom of countabtlity and its Borel structure is standard. 


Proof. Since # is separable, there exists a countable set D dense in %, 
and the topology for WY is clearly the smallest one which makes all the 
maps U —> Uf (f e D) continuous. As there are only denumerably many 
maps, it follows from standard topological arguments that the topology 
for Y is metrizable and second countable. To prove that X is a topological 
group we must show that the map U, V + UV ~? is continuous. But, as 
the elements of Y are unitary, we have, for any fe % 


JUV -*f—UgVo fil s|VVof)—Vo(Vo Mf) + U(Vo7 Ff) — Uolo tS). 


This inequality proves at once that X is a topological group. 

To prove that the associated Borel structure of X is standard, we must 
exhibit a Borel isomorphism of @ with a Borel space which is known to be 
standard. Let X be the topological product of countably many copies of 
H and let D={f,, fo,---} be a countable orthonormal basis for # For 
each Ue Y, let t(U) be the element (Ufi, Ufas») of X; 


(67) g : U —> (Ufi, Ufa»). 


¢ is clearly a homeomorphism of YW into X. We want to show that the 
range of ¢ is a Borel set in X. Clearly, (z,, 22,---) e X lies in the range of ¢ 
if and only if {z,, z.,---} is an orthonormal basis, i.e., if and only if 


<Zmsen> = Om, (Kronecker delta) 


68 
(68) > Kzmfol = 1  foralli. 

These are countably many equations and each one of them defines a Borel 
subset of X. Thus the range of ¢ is Borel. ¢ is thus an isomorphism of @ 
with the Borel subspace of X defined by (68). As this is a Borel subset of 
the standard space X, we can conclude that X is standard. 
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Let now G be a separable Borel group and let u be a left invariant 
o-finite measure on G. Let #, be the Hilbert space of complex valued 
(equivalence classes of) Borel functions f on G such that 


(69) Ifi? = f Lf (g)|2dulg) < 00. 


For any g e G, let L, be the operator of #, defined by 
(70) (LfN = f) (eEG). 


L, is clearly a unitary operator of #,. We have 


(71) L= 1, the identity operator, 
Laos = Lo Lo, for all 91, 92 E G. 


Denote by Y, the group of all unitary operators on %,. 


Lemma 5.35. Let G be a separable Borel group which has a left invariant 
o-finite measure u. Then the map L(g — L,) of G into Y, is a one-one Borel 
map. If G is standard, its range LIG] is a Borel set in Y, and L is a Borel 
isomorphism of G with L{G]. 


Proof. In view of the definition of the Borel structure on Y,, we must, 
in order to show that L is Borel, prove that for each fe %,, g > L,f is a 
Borel map of G into #%,. Since #, is separable, we can use lemma 5.4 to 
reduce this to showing that for each f, f’ € Æ g > <L,f,f'> is a Borel 


function on G. But (x denotes complex conjugation) 


CLS, f’ = f f- 0s" (t)*dult), 


which is obviously Borel, as g, t > f(g~4t)f’(t)* is a Borel function on 
GxG. 

We show next that L is one-one. As L is a homomorphism of G into 
U,, this reduces to showing that if g4e, L,fAf for some f e #,. Suppose 
that L,f=f for all fe Æ, Since the Borel structure of G is countably 
generated and p is o-finite, there exists a sequence {H,} of sets of finite 
p-measure such that {#,} generates the Borel structure of G. Let fa = xg, 
Then f, E€ X, and the sequence {f,} separates the points of G (since each 
single point set of G is a Borel set). Let N be a set of -measure zero such 
that for xN, 


(72) Pilg 7 1x) = fala) 


for all n. The fact that {f,} is a separating sequence now implies that 
g~*x=z for all x e G—N. The truth of this for even one z e @G—N implies 
g =e. L is thus one-one. The remaining statements follow from the fact 
that V, is standard. 
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We shall now introduce the Weil topology. Let G be a separable Borel 
group admitting a left invariant o-finite measure u. Consider for each 
g € G, the unitary operator L, in #,. The Weil topology for G is then defined 
to be the smallest one which makes the maps 


(73) g —> Lf 


of G into #, continuous for all fe #,. In view of the definition of the 
strong topology for Y, it is clear that the Weil topology is that topology 
for G which makes the map g —> L, of G into Y, a homeomorphism. Thus 
G, under the Weil topology, is a topological group satisfying the second 
axiom of countability. 

Let # and F be Borel sets of finite measure. We write 


(74) p(E,F) = w((E—F) U (F—B)). 
Then from the relations 


p(B, F) = f Ixa(t) —xe(2)|2du(t), 
G 
p(B, F) = p(vB,xF), 


it is easy to deduce that x, —> x in the Weil topology for G if and only if 
p(x, ,xE) — 0 for all Borel sets E of finite measure; it is even sufficient to 
restrict the E’s to a Boolean ring of sets of finite measure which generate 
the Borel structure of G. 


Lemma 5.36. If G is a standard Borel group and n a left invariant o-finite 
measure on it, the Borel structure generated by the sets which are open in the 
Weil topology coincides with the original Borel structure. Moreover, if G is 
a lesc group, the Weil topology coincides with the original topology. 


Proof. Let G, be the range of L(g — L,). Let @* be the Borel structure 
for G generated by the Weil topology and let Z be the original Borel 
structure. Then, L being a homeomorphism of G onto G, L is a Borel 
isomorphism of (G,4*) with the Borel subspace of Y, defined by G,. By 
lemma 5.35, L is also a Borel isomorphism of (G,¥%), with the Borel 
subspace of %, defined by G,. Hence @=2*. 

Let us now assume that G is a lese group. We want to prove that its 
topology coincides with the Weil topology. Since both topologies are 
metrizable, it is obviously sufficient to prove that for any sequence {gp} 
in G, ga —> e in the given topology of G if and only if p(g,#,H) —> 0 for each 
Borel set E of finite measure. All topological statements, in the argument 
to follow, refer to the given topology on G. Let then g, — e. If {V,} is a 
decreasing sequence of compact symmetric (V,= V,~*) neighborhoods of e 
which form a base for G at e, then for any compact set K, it is easy to see 
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that (ìs (V,AK)=K. Fix a Borel set E of finite measure, and let « > 0. 
Choose a compact KSE so that u(E—K)<e and choose s so that 
u(V,K —K)<e. If no is chosen so that g e V, for n>, then for all 
such n, 

pGnL,L) < p(g,K,K)+2p(L,K) 
2u( VK —K)+2¢e 
te. 


In the converse direction, suppose g, +> e. By passing to a suitable sub- 
sequence we may assume that there exists a compact neighborhood V of 
the identity such that g, ¢ V for all n. Choose a compact neighborhood W 
ofesuchthat WW-'cV.Theng,W A W= @ forall n so that p(g,W,W)= 
2u(W)>0. This shows that g, cannot converge to e in the Weil topology. 
This completes the proof of the lemma. 


lA 


IA 


Consider now a standard Borel group G with a nonzero left invariant 
Borel measure u. We equip G with the Weil topology. Following standard 
terminology, we shall say that a subset ACG is bounded if for each open 
set U in G one can find elements z,,---, x, E G such that 


Ac ÙU 4U; 
i=4 


It is obviously enough to require this only for open sets containing e. 
We shall say that G is locally bounded if some open set containing e is 


bounded. Notice that open sets in G are Borel (lemma 5.36); if U is open 
and nonempty, there are elements z,, X2,---, in G such that G=\), 2,U, 
from which it follows at once that 0 <p(U)<oo. 


Lemma 5.37. G is locally bounded. Every bounded Borel set is of finite 
measure. 

Proof. Before we take up the proof of this lemma we make a simple 
observation. Let Æ be a Borel set with E = E-t and 0 < (E) < oo. Then there 
is a Borel subset FSE with F=f} and 0<p(F)<oo. In fact, since u 
is o-finite, there is a Borel function f on E with f(x)>0 for all xe E and 
Í pfdu<œ; replacing f(x) by min(f(x),f(z~*)) we may assume that 
f(x)=f(x7+) for all x in Æ; it then suffices to take F as {x : f(x) >c} for 
some c >Q. 

This said, we come to the proof of lemma 5.37. Let E be a Borel set such 
that E=ÆE-?! and 0<p(E#)<oo. Put, for 0< e< ?2u(E), 


N(E : e) = {x : p(zE,E) < e}. 


Since x —> p(xE,E) is a continuous function on G, vanishing at e, N(E : e) 
is an open set containing e. We claim that 


0 < p(N(E :€)) < œ. 
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We need only to prove that u(N(E : e))< œ. Now, we have’ 
ff xelxelt*e)du(erdu(e) = fx sO a(t) dy (t 
GxG 
= w(B)?. 
On the other hand, as H= £7}, 


f xel0xs( “a )du(t)dula) = {{ Xel(t)xe(x~ “t)du(t)du(z) 


GxG GxG 


u(xE A E)dp(zx). 


Ii 
at, 


Therefore, 


pH)? > (zE A E)dp(x). 


But, if xe N(E : e), 


p(xE,E) = 2(E)—2u(xE N^ E) < e 
so that 
(E) > (m(E)-—e]2)u(N(E : e)). 


This proves our claim that 0<p(N(E : e))}< œ. 


At this stage, we know that sufficiently small open subsets of G are of 
positive finite measure. Let U be any open set containing 1 with 0< 
p(U)<oo and V an open subset of U such that ee V and VV-!cU. We 
claim that V is bounded. Otherwise there would exist an open set W 
containing e and an infinite sequence 7, v,---, of elements of V such that 


Ensi E U W (n = l, 2). 


Choose an open set L containing e such that LL-!<cW NA V. Then it is 
easy to see that the sets x, L, x, L,--+ are disjoint and contained in V V71. 
Since V V -tc U, 0< p(U)< œ, and since (L) > 0, we have a contradiction. 
The last statement is easy. For, let A be a bounded Borel set and W an 
open set of finite measure. If z,,---, 7, € G are such that ASUR- 2,W, 
then p(A) <np(W) <oo. 

We shall now use these results to imbed G as a dense subgroup of a lese 
group G*, Our argument is based on the observation that G admits a left 
invariant metric metrizing its Weil topology; in fact, if {#,} is a denumer- 
able Boolean ring of Borel sets of finite measure which generate the Borel 
structure of G, the metric 


B plz” n YEr) 3-n 
ie = Drege” 


is easily verified to induce the Weil topology; 6 is left invariant. 
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Lemma 5.38. Let G be a second countable topological group and let it satisfy 
the two conditions : 


(a) there is a left invariant metric ô for the topology of G, 
(b) af {x,} ts a -Cauchy sequence, then {x,~*} contains a 6-Cauchy 
subsequence. 


(76) 


Let G* be the completion of the metric space (G6) with G considered as 
imbedded in G*. Then there exists exactly one way to convert G* into a 
topological group such that G becomes a dense topological subgroup of it. 


Proof. AS 8(%_;%m)=8(%,~1X_,e), it follows that {x,} is a 6-Cauchy 
sequence if and only if z,~ 12, > e as n, m —> oo. Let G* be the completion 
of G (under ô); we regard G as imbedded in G*. We denote also by 6 the 
extension of the metric 6 to G*, which is a second countable topological 
space in its metric topology. 

We shall now show that if {a,} and {y,} are Cauchy sequences in G, so 
is {£ Yn}. Let V be any open neighborhood of e and let us choose an open 
neighborhood W of e such that W°<V. Let k be an integer such that 
Yn Ym € W for n,m>k. Choose an open neighborhood W’ of e such 
that WSW and y` W'y a W. Finally, select an integer k,>k such 
that 2, ~ t£ E W’ for n, m> k. Then, if n, m> k, 


Yn En Lm Ym = (Yn Y Ye (En Em)Yk Yk Ym 
Ee W(y,71W’y,)W 
< V. 


This proves that {z, Yn} is Cauchy. From this it follows easily that there is a 
unique continuous map p* of G* x G* into G* such that p*(x,y)=xy 
(x,y € G). We define, for z,y € G*, 


(77) xy = p*(x,y). 


This product operation is clearly associative and xe = ex =~ for all x e G*. If 
x Ee G* and {x,} is any sequence in G converging to x, then, for any limit 
point x’ of the sequence {z,~'} we have xx’ =x'x=e. Condition (76b) now 
implies that x-1 exists and that x, 1 converges to 2~+. The rest of lemma 
5.38 now follows trivially. 


Corollary 5.39. Let G be a locally bounded second countable topological 
group whose topology is induced by a left invariant metric 5. Then G satisfies 
(b) of (76). In this case G* is a lesc group and for any compact set K of G*, 
K NA G* is a bounded set in G. 


Proof. Suppose V is a bounded set in G. We claim that any sequence in 
V has a Cauchy subsequence. To prove this we proceed as follows. Let 
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{Un} be a sequence of open subsets of G forming a base for the topology 
of G at e such that U7},U,,,¢U,, for all n. If {x,} is a sequence in V 
having infinitely many elements, it follows from the boundedness of V 
that, for each k, some translate a,U,, contains infinitely many of the z, 
This means that for some infinite set Z, of the integers, 7, £m E U,,~*U,¢ 
U,_, whenever n,m e Z,. By choosing the Z’s to decrease with k and 
using the Cantor diagonal procedure we may assure ourselves that {z,} 
has a subsequence which is Cauchy. 

We shall now prove that if {x,} is any Cauchy sequence, {x,~+} has 
a Cauchy subsequence. It is enough to exhibit a bounded set containing 
all but finitely many of the elements x,~*. Let U be a bounded neighbor- 
hood of e. Now there is a k such that z,“ tze U for n>k. Hence 
z, tE Ux,” for all n > k; as Uxt is bounded, our assertion is proved. G* 
can thus be regarded as a second countable topological group and G a 
dense subgroup. To prove that G* is locally compact, it is enough to exhibit 
some compact subset of G* with nonnull interior. Let V be any subset of 
G, bounded and open in G and containing e. Since any sequence in V 
contains a Cauchy subsequence, V, the closure of V in G*, is compact. 
Since G is second countable, there is a sequence {x,} in G such that 
G=\|), £, V. We claim that G*=(), 2,(VV). In fact, if y* c G* and {y,} 
is a sequence in G such that y, —> y*, there exists an m such that y,,~1y, E V 
for all n>m. Thus y, € YV for all n>m. Choose an integer k such that 
Ym E Ly V. Then y, €2x,VV for all n>m, showing that y* € x V V. By the 
category theorem of Baire, some x,VV contains an interior point. This 
implies that VV is a compact set with nonempty interior and shows that 
G* is locally compact. 

For the last assertion, let K be compact in G* and U an open set in G 
containing e. Let U* be an open subset of G* such that U=U* AG. 
Select an open set V* in G* such that e e V* and V*V*c U*. There exist 
21*, %o*,---,u,*EG* such that KCl, 2*V*. We choose 2,64 
(l<t<k) such that 2,~}27,*e V*. Then KeU a V*V¥CUE, 2U* 
and so KN GEURE gU. 

We come to our final lemma. Let G be a separable Borel group with a 
left invariant o-finite measure p. 


Lemma 5.40. If G is standard, then G is already locally compact under its 
Weil topology. 


Proof. Under the Weil topology, G is locally bounded and second 
countable. We also know that the Weil topology is induced by a left 
invariant metric. Corollary 5.39 is thus applicable and hence we may 
regard G as a dense subgroup of a lIesc group G*. For any Borel set 
A*<G*, A* ^G is Borel in G and hence, 


(78) u* : A*—> w(A* AQ) 
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is a measure on G*, Now p is finite for bounded sets and so for any compact 
KoG*, u*(K)=p(K A G)<oo. p* is thus a Borel measure. We claim that 
p* is left invariant. Clearly u* is invariant under left translations by 
elements of G. Suppose now that g* c G* and g, €G such that g, —> g*. 
For feC,(G*) and he G*, we write f*=L,f. Then, for any continuous 
function f* on G* with compact support, f*%) —> f*® uniformly while all 
the functions f*%) vanish outside a fixed compact set. This implies that 


f peona | pond 


and hence 
fodur = f produt. 
G* G’ 


Since this is valid for all f* € C,(G*), it is clear that u* is invariant under 
left translation by g*. In other words, p* is a left Haar measure. p* is 
evidently nonzero. 

Now G is standard and the identity map of G into G* is Borel and one- 
one. Hence G is a Borel set in G*. But then 


p*(G*—G) = p((G*—G) NA G) 
= 0. 


Thus G is a subgroup which is also a Borel set with a complement of 
zero measure in G*. Therefore, the observation we made at the beginning 
of theorem 5.30 is applicable and enables us to conclude that G=G*. In 
other words, G is already locally compact in its Weil topology. 

Combining lemmas 5.32 through 5.40 we obtain the following theorem. 


Theorem 5.41. (Mackey [5]). Let G be a standard Borel group. If G admits 
a left (or right) invariant measure class, then G admits a left or right invariant 
a-finite measure. Moreover, there exists exactly one topology for G which 
converts G into a lesc group and whose corresponding Borel structure is the 
original one on G. 


NOTES ON CHAPTER V 


The measure theoretic study of the orbit spaces of locally compact groups 
has been a very active subject in recent years because of possible applica- 
tions to representation theory, number theory, statistical mechanics, and 
so on. For a survey treating many of these aspects see R. Zimmer, Ergodic 
Theory, group representations, and rigidity, Bull. Amer. Math. Soc., 6 (1982), 
pp. 383-416. 


CHAPTER VI 
SYSTEMS OF IMPRIMITIVITY 


1. DEFINITIONS 


This chapter discusses the notion of a system of imprimitivity for a 
group and, in some cases, describes all such systems to within unitary 
equivalence. These problems were first solved by Frobenius for the case of 
finite groups. For infinite groups their study is more recent. Their first 
appearance seems to be in a series of examples constructed by Murray and 
von Neumann [1], [2] in their theory of rings of operators. Anticipating 
terminology, we can say that the examples of Murray and von Neumann 
form a class of ergodic systems of imprimitivity. On the other hand, the 
first work on transitive systems appeared in 1939. In his great 1939 paper 
Wigner underscored their importance for building up the theory of 
representations of the inhomogeneous Lorentz group and studied, 
although in quite an implicit fashion, the systems of imprimitivity associ- 
ated with this group. Also Gel’fand and Naimark [1] based their solution 
to the problem of describing the unitary representations of the “ax+b”’ 
group on an analysis of certain natural systems of imprimitivity. 

The systematic development of this concept is due to Mackey. In a 
series of papers (cf. Mackey [7] for all references) he obtained the main 
general results of the subject and pointed out the far-reaching scope of the 
ideas involved therein. The variety of applications of these results is quite 
impressive. They enable one, among other things, to obtain the equivalence 
of matrix and wave mechanics; they lead to a concise description and 
classification of the relativistic wave equations; they make possible a 
unified approach to the theory of representations of the commutation 
rules; also, on the purely mathematical side, they lead to the notion of 
induced representations of locally compact groups which has played an 
important role in the theory of group representations; and finally, they 
have very interesting implications in the theory of invariant subspaces and 
factorization of analytic functions (cf. Helson [1}). 

We begin with the notion of a representation. Let G be a lesc group and 
HA a Hilbert space (always separable). A (unitary) representation of Gin 7 
is a homomorphism 
(1) U:g—>U, 
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of G into the group X of all unitary operators on # such that for each 
fe, the map 


(2) g—> U,f 


is continuous from G into # Since Y is second countable in its strong 
topology (lemma 5.34), lemmas 5.28 and 5.4 show that for the homo- 
morphism g —> U, to be a representation it is sufficient that 


(3) g> Uf f> ff e#) 


are measurable on G for all f, f’ € # A representation U in # is said to be 
equivalent to a representation U’ in X” if there exists a unitary isomorphism 


(4) W: H > HX’ 
of H onto #’ such that 
(5) U; = WU,W-? 


for all g e G. The representation U on # is said to be irreducible if the 
only closed linear manifolds of # which are invariant under all the linear 
operators U, are 0 and #. 

If Gis a locally compact abelian group, it is well known that its irreduc- 
ible representations are none other than its characters; i.e., continuous 
homomorphisms of G into the multiplicative group of complex numbers 
of modulus 1. Two characters are equivalent if and only if they are 
identical. The theory of characters was developed in its definitive 
form by Pontrjagin [1]; this development was used by Weil [1] to obtain 
the Plancherel formula for locally compact abelian groups. 

On the other hand, when G is not necessarily abelian, but compact, a 
definitive theory can again be developed. The beginnings of this theory 
can be traced at least to the theory of spherical harmonics. But in its 
modern form, the theory is undoubtedly the creation of Hermann Weyl. 
Using powerful and original (transcendental) methods he obtained the 
representations of arbitrary compact groups, his version of the Plancherel 
formula for such groups (the Peter-Weyl theorem), and rounded off the 
theory with a succinct description of the representations of compact semi- 
simple Lie groups (cf. his papers [3], [4], [5] in Mathematische Zeitschrift 
and his expositions [1] (to an audience of physicists) and [2)). 

The Lorentz group is neither compact nor abelian. The study of its 
representations must therefore be based on the theory of representations 
of lcsc groups which are neither compact nor abelian. This theory is vast 
and quite recent. A survey of some of this recent work is given in Mackey’s 
address [7]. 

Suppose that G is a lesc group. Let X be a standard Borel space. We 
relate G and X by assuming that X is a Borel G-space. Let # be a 
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separable Hilbert space. A system of imprimitivity for G acting in # is a 
pair (U,P), where P(E — Pp) is a projection valued measure on the class 
of Borel subsets of X (the projections being defined in #), and U(g — U,) 
is a representation of G in # such that the relations 


(6) U,P,U,~* = Pox 


are satisfied for all g e G and all Borel sets Ho X. The pair (U,P) is said to 
be based on X. The notions of equivalence and irreducibility are defined 
for systems of imprimitivity in a fashion similar to the corresponding 
notions for representations. Two systems (U,P) (in #) and (U’,P’) 
(in Æ’) based on the same G-space X are equivalent if there exists a 
unitary isomorphism W of # onto #” such that 


U, = WU,W-}, 
P; = WPW- 


for all ge G and all Borel sets Ec X. The system (U,P) is said to be 
irreducible if there is no closed linear manifold of % other than 0 and 
A, which is invariant under all P; and all U,. Note that if U is an irre- 
ducible representation, (U,P) is an irreducible system of imprimitivity, 
but the converse is generally false. If (Ut, Pt) (i=1, 2,---) is a sequence of 
systems of imprimitivity all based on X, with (U',P') acting on #, then 
it is natural to define their direct sum as the system (U,P) acting in % 
where 


(7) 


KH = SK 
and where 
Psi (Ji fatt) > (Pfi Pfa), 
U, : (i fos ° -) —> (U fi» Ufa ° -). 
In symbols: 


(U,P) = @® (U',P"). 
i 

In order to be able to handle the concepts of irreducibility, equivalence, 
and so on, it is convenient to transform these geometric concepts into 
algebraic ones. It is customary to do this with the help of the notion of the 
commuting ring. We work with representations, but all of our remarks 
apply also to systems of imprimitivity. Let U be a representation of G 
in #. The commuting ring of U is the set A of all bounded operators A 
in # such that 


(8) U,A = AU, 


for all g e G. Obviously A is an algebra and, if A e X, then A* e A. More- 
over it is easily seen that 2 is closed under the weak topology. % is thus a 
von Neumann algebra (cf. Section 4, Chapter VII, Volume I). The 
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structure of A determines very completely the geometry of the action of 
U. For example, a closed linear manifold L is invariant under U if and 
only if the projection P* on L belongs to A. In this case, by restricting 
each U, to L we obtain a representation of G in L, called the subrepresenta- 
tion of U defined by L. Thus, every projection in A defines a unique 
subrepresentation of U. This subrepresentation is irreducible if and only 
if the associated projection, say Q, in A, is minimal, i.e., if Q' <Q, Q' is a 
projection and Q’ eX, then Q’=Q or Q’=0. Thus, U is equivalent to a 
direct sum of irreducible representations if and only if 1 is a direct sum of 
minimal projections in X. U itself is irreducible when and only when the 
only projections in % are 0 and 1. Now if any operator A belongs to X, 
all the spectral projections of (A+ A*)/2 and (A —A*)/2i belong to A. 
Thus U is irreducible if and only if A consists of the multiples of the 
identity operator. 

These remarks, which will suffice for our purposes, are the basic ones 
around which a theory of the global geometry of an arbitrary representation 
can be built. The reader, who is interested in this should consult the works 
of Mackey [4], [7], Varadarajan [5], and Loomis [3]. 

Suppose (U,P) is a system of imprimitivity based on X acting in Z. 
For any bounded Borel function f on X (complex valued) let the operator 
A, be defined by 


(9) (Av) = f fla)dvy, (2), 


where v,,, is, as usual, the complex valued measure on X defined by 
Vu (E) = <Pgu,w) (u, v Ee Æ). 


vau is a finite measure for any u € # and v,,„(X)= ||u||?. Moreover, 
Au = È Pde) — (we 2). 


The mapping f — A, from the algebra of all bounded Borel functions into 
the algebra of all bounded operators in # has the properties described 
below; their verifications are easy: 


(i) f— A, is an algebra homomorphism, 
(10) (ii) if fis real, A, is self-adjoint; more generally A,*=A,., 
(ii) J4] < sup |f(@)]. 
rex 
Moreover, using standard arguments, we obtain, from (6), 
(11) U,A,U,-1 = Ags, 


where f° is the function x —> f(g~?-2). 
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An important special case arises when X is a locally compact Hausdorff 
space satisfying the second axiom of countability and the action of G on 
X is continuous. In this case, it is perhaps more natural to replace the 
projection valued measure P by the homomorphism f— A, (fe¢C,(X)). 
We have: 


Lemma 6.1. Let f — A, be a homomorphism of the algebra C,(X) into the 
algebra of all bounded operators on a separable Hilbert space # satisfying 
the relations (10) and such that the linear manifolds spanned by the ranges of 
all the operators A, is dense in I. Suppose further that g —> U, is a unitary 
representation of Gin # such that (U,A) satisfies (11). Then, there exists 
exactly one system of imprimitivity (U,P) based on X such that, for each 
feC(X), A, is related to P by (9). 


Proof. For any f in C,(X) which is real and >0, we observe that 
A,>0; in fact, f=/f,?, where fı is also real and >0 and 4,=4A,,?20. 
Consequently, the map f —> ¢A,u,u> defines a Borel measure v, „ on X. If 
K is any compact set and f is an element of C,(X) such that 0<f<1 and 
f=1 on K, the inequality (iii) of (10) shows that 

YuulK) < |jull?. 
vy, is thus a finite measure and v, ,(X)<||wl|?. At the same time, if 
u,v E€ # there exists a unique complex valued measure v, , such that 


(12) <A u, = f fdv» (fECAX)). 


For any Borel set E, the map u, v —> v,,,(Z) is *-bilinear and 0 < v, ,(Z) < 
ul? for all u e #. Therefore, there exists a self-adjoint operator Pp 
with 0< Pp <1 such that 
(13) vu (E) = (Pzu,v> 
for all u, v e€ Z. 

We want to show that E —> P, is a projection valued measure. Suppose 


K is a compact set and {f,} is a sequence of real nonnegative functions in 
C,(X) such that fa | xx pointwise. Then (12) shows that 


(14) <A, u,v) —> <P pu) 


for all u, v e #; i.e., A; converges weakly to Pg (> will denote weak 
convergence). If f is any element of C,(X), A; A;>P,A, and 4,4, => 
A,P, so that Pg commutes with all A,. Moreover, if f>0, then ff, | fxr 
and hence 


CAs, Aru) = f fata,» 


— Í fdvy,y. 
K 
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In other words, if f e C.(X) is real and >0, 


(15) (P,A,u,v) = | favan. 


If we choose another compact set L and a sequence {fa} of real non- 
negative elements of C,(X) such that f,’ | v,, we obtain, from (15) and the 
convergence A, > P,, the equation 


(P,u,P ev = (Proart). 


Now, for fixed K, E —><Ppgu,Pgvy and E —> <Pengu, wy are complex 
valued measures which coincide for all compact sets L. Hence they co- 
incide for all E. For arbitrary but fixed E, we now use the resulting 
coincidence for all K, and conclude that for all Borel sets E and F, 


(Pru, Prvy = <Penru wy 


for all u, ve #, i.e., Peapr=PgPr. For E=F this shows that P, is a 
projection. The map # —> P, has now all the properties of a projection 
valued measure except perhaps that Px may not be 1. But P,=1; for, if 
P,#1, there will be a nonzero ue & such that P,yu=0. This would 
imply that v, ,=0 and hence that A,*u=0 for all feC,(X). But then u 
would be orthogonal to the ranges of all the A,, a contradiction. Finally, 
by a routine transport of structure, the equation 


U,A,U,7) = Avs) 
is seen to lead to the equation 

U, PU} = Por. 
This proves the lemma. 


We shall now give some examples of systems of imprimitivity. X and G 
have the usual meanings, i.e., X is a standard Borel G-space, G is lesc. 


Example 1: Let a be a o-finite invariant measure on X, W = £?(X,a); U, 
and P, are defined by 


Prf = xef, 
(U,f)(x) = f(g x). 
The identities U,,,,=U,,U,,, U.-=1 as well as the relations (6) are the 
results of routine calculations. Further, the invariance of a implies that 
U, is unitary. For f, f'€ # the map g, x — f(g7!-x)f'(x)* is Borel on 
Gx X and hence the map 


(16) 


g><U, ff = f f(g-+-x)f'(x)*da(a) 
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is a Borel map. U is thus a representation and (U,P) a system of 
imprimitivity. 

Example 2: Our second example is a variation on the theme of the 
first one. Let & be a separable Hilbert space. We then define, for any 
o-finite measure «œ on X, #=$7(X,4,a), to be the vector space of a- 
equivalence classes of Borel maps f of X into X such that 


(17) isl? = f æde) < oo: 
here |- | denotes the norm in % If we define, for f, f’ E€ Æ, 
(18) FP = | SOS edal), 


where <.,.> denotes the inner product in 4% then # becomes a Hilbert 
space under <.:.>. If X is a separable Borel space, # is separable. 

We shall now select an arbitrary invariant measure class on X and a 
c-finite measure « in it. a need not be invariant. For each g e G we select 
a positive Borel function r, which is a version of da/da *). We then define, 
for fE X 


Uf = (Y, 
1.e., 
(20) (Uaf )(x) = {ra(g9 7 -f(g 2). 


The quasi invariance of «œ implies that everything in sight is well defined. 
The identities (20) of Chapter V, which are atisfied by the 7,, now lead 
to the conclusion that g —> U, is a homomorphism of Œ and that (U,P) 
satisfies (6). The calculations are straightforward. Moreover, as each U, is 
invertible and 


[USI = | CIP) 


= | rolf) daoa) 
= |Z], 


it follows that each U, is a unitary operator. In order to assure ourselves 
that (U,P) is a system of imprimitivity, it remains only to check that for 
each f, f eÆ, g—><U,f: fò is a measurable function on G. Now, by 
theorem 5.10, there exists a positive Borel function r on G x X such that 
for almost all g, r,(z)=r(g,x) for a-almost all x. Therefore, for almost 
all g, 


(Uf sf) = f, fr(g.g- «Cf (g-4-2), f'(@ydala). 
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The right side is a Borel function of g since the integrand is Borel on G x X, 
so that g > (U,f : f> is a measurable function. But, as noted above, this 
is sufficient to assure us that U is a representation. 

When X is the phase space of a classical mechanical system, G' = R}, and 
the action of G on X is given by the dynamical group of the system, the 
representation U in example 1 was considered by Koopman [1] (« is the 
Liouville measure). Prompted by this, we shall call the system of example 2 
(of which the system in example 1 is a special case) a Koopman system of 
umprimitivity. 

In this example, the fact that (U,P) satisfies (6) depends decisively on 
the identities satisfied by the r,. It is natural to suspect that one may be 
able to construct other systems of imprimitivity, by using more general 
cocycles. This is substantially the case. We shall develop this idea in the 
following sections. 


2. HILBERT SPACES OF VECTOR VALUED FUNCTIONS 


The Hilbert space in which the system of imprimitivity of example 2 
of Section 1 acts is a space of #-valued functions. This situation is very 
typical. We devote the present section to a detailed study of certain 
algebras of operators in such Hilbert spaces. 

Let # be a separable Hilbert space. We use the notation |-| to denote 
the norm in #% and <.,.> to denote the inner product in X. For any 
operator A of & we write |A] for its norm. Let S be the set of all bounded 
operators in X. The unitary group M of X is a subset of S. We shall con- 
sider S as a Borel space by giving it the smallest Borel structure which 
makes all the maps A —> <Au,v> (u, v e X) Borel. When we speak of Borel 
maps from or into £, this is the Borel structure we have in mind even when 
it is not made explicit. 


Lemma 6.2. S, equipped with the above Borel structure, becomes a standard 
Borel space and M is a Borel subset of tt. The maps A, B—> AB and 
A,B—+>A+B are Borel from SxS into S, A, u —> Au is Borel from 
Sx X into X, while A —> A* is a Borel automorphism of S. The induced 
Borel structure on M coincides with the Borel structure associated with the 
strong topology on M. 


Proof. Choose an orthonormal basis {e,} in X. Let T be the space of all 
complex matrices (a). We may identify T canonically with a product of 
copies of the complex plane. T thus becomes a standard Borel space. 
For A e S, we define 


(21) a,(A) = (Ae,,e,>. 
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Obviously, the Borel structure on S is the smallest one which makes all 
the maps a,,(A —a,,(A)) Borel. Thus, if t(4) denotes the element 
(a,,(A)) of T, (A — t(A)) is a one-one Borel map of S onto a set, say S’ 
contained in T, and the Borel structure on S is that one for which ¢ is a 
Borel isomorphism with the Borel subspace of T defined by S’. To prove 
that S is standard, it is sufficient to prove that S’ is a Borel subset of T. 
Only the case dim % = œ is of interest. Now, if (a) is an element of T, 
it is easily seen to belong to S’ if and only if there is an integer L such that 
for any integer N, and any N complex numbers ¢,..., cy with rational 
real and imaginary parts, 


2 


i 


N 


> Qj 5C; 


j= 


2 


< L(|c,|?+---+|ey|?). 


Since the conditions involved are only denumerably many and since 
each condition defines a Borel subset of T, it follows that S’ is a Borel set. 
This proves that S is standard. 


The equations 
a,(AB) = > (Aar; (B), 
k 


a;(4*) = a,(A)*, 
a, (A+B) = a,,(A)+a,,(B), 
<Au,e,» = > LU,e a(d), 
j 


tell us immediately that A, u— Au, A, B — AB, and A — A* are Borel 
maps with appropriate domains and ranges. Since 


M = {4A : Aes, AA* = A*A = J}, 


it follows from these results that M is a Borel set in S. Finally, the induced 
Borel structure on M is the smallest one which makes all the maps 
A — Au of M into X (ue X) Borel, and is thus clearly the Borel 
structure associated with the strong topology on M. 


Corollary 6.3. Suppose Z is a Borel space and z —> A(z), z > B(z), and 
z — u(z) are Borel mappings of Z into S, S, and & respectively. Then 
z —> A*(z) and z > A(z)B(z) are Borel maps of Z into S and z —> A(z)u(z) 
1s a Borel map of Z into X. 


Consider now a standard Borel space X, a o-finite measure a on X and 
the Hilbert space # = £?(X,%a). For each Borel set ES X, 


(22) Prif>xef (JEH) 
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is a projection operator, and P(E —> Peg) is a projection valued measure. 
The main concern of this section is with the algebra of bounded operators 
of # which commute with all Pg. We consider bounded Borel maps b: 


(23) b : x — W(x), C = sup |b(x)| < œ 
xex 


of X into S. If f is any Borel function on X with values in Æ such that 
f, I/(@)|?da(z)< 0, then x — b(x)f (x) is also Borel and 


f BASA dola) <0 f fdata 


Thus there exists a unique bounded operator, denoted by b~, such that 
for each f e # for «-almost all x, 


(24) (6~f)(x) = b(x)f (x). 


b~ is an operator in % its norm ||b~ || <C, and it follows from (24) that b~ 
commutes with P, i.e., for all £, 


(25) b~ P, = P,b~. 


It is obvious that if b, is another bounded Borel map of X into S such that 
b (x)=b(x) for «-almost all z, 6,~=b~. We regard the collection of 
bounded Borel maps from X into S as an algebra with involution by 
defining 


(aibi +@qbo)(%) = a,b (£) + a562(z) (a;,@, complex numbers) 
(26) (bc)(x) = b(x)c(x), 
b*(x) = b(x)*, 
for all xe X. 


Lemma 6.4. For any bounded Borel map b of X into S, b~ is a bounded 
operator in A commuting with P. Conversely, if B is an operator in # 
which commutes with P, there exists a bounded Borel map b of X into S such 
that b~ = B. b,~ =b,~ if and only if b,(x)=6,(x) for a-almost all x. Finally, 
(biba) ~ =b,~b3~, (b1*)~ =(b,~)*. In particular, b~ is a unitary operator 
(projection, normal, etc.) if and only if b(x) is unitary (projection, normal, 
etc.) in X for «-almost all x. 


Proof. We assume that a(X)<0oo; the general case is obtained by an 
obvious patching up over sets of finite measure which we leave to the 
reader. For any u e X we write, for the purpose of this proof, u for the 
constant function xz —> u of X into X; ue # as a(X)<oo. 

We observe that if B, and B, are two bounded operators of # which 
commute with P, and if B,u= B,u for all u e % then B, = B. In fact, for 
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any Borel set ESX, B,(y,u)=B,(y,u). From this it follows that 
B,f=B,f whenever f is a Borel function on X (with values in %) taking 
only finitely many values. The general case now follows from the fact that 
such functions are dense in #. Thus B, = B3. 

This done, let B be a bounded operator in # such that B commutes 
with P. We may assume that | Bl <1. Let {e,} be an orthonormal basis 
for X. Let b, be a Borel map of X into X such that 6,=Be,. Now 
|B| <l; so, for any N complex numbers ¢,,---,¢cy, as B commutes 
with P, 


| BPa(cye, + ++» +¢yey) |? = ||xelc1 Be, + - + - +¢y-Bey)|]? 
< |lxe(¢re1 +--+ +¢yew)||?, 
1.€., 
lye(cy Be, +--+ +¢yBey)||? < (\e,|?-+ +++ + |ey|2)oe(Z) 
for all Ẹ, i.e., 


f logby() + + - - + eyby(ae)|2dee(x) < (Je1|2+ ++ + fey|)ee(Z) 


for all E. For fixed N and ¢,,---, cy, these inequalities (for all £) imply 
that 


le,bi (a) + +++ +eyby(x)|? < fey]? +--+ + fey? 


for «-almost all x. By varying N and the c’s over complex numbers with 
rational real and imaginary parts, we can assure ourselves that for some 
a-null set Z we have the inequalities 


(27) [cb (x) + vee +Cyby(x)|? < le,|? + + len|? 


valid for all x e X — Z, all integers N, and all sets (c,,---, cy) of N-tuples 
of complex numbers with rational real and imaginary parts. From (27) 
we conclude that for each x e X — Z there exists an operator b(x) in X with 
|b(x)| <1 such that 


(28) b,(x) = b(x)e, 


for all r. We define b(x)=0 for x e Z. Equation (28) shows at once that 
x —> b(x) is a Borel map from X into S such that |b(x)| <1 for all x and that 
for the operator b~ 


(29) b~e, = Be, 


for all r. From (29) and from the observation made at the beginning of the 
proof, we deduce that B=b~. To prove that b is unique up to null sets, 
let b, be such that 6, ~ =b~. Then for each r, b,(x)e,=6(x)e, for a-almost all 
x. This shows at once that b (x)=b(x) for a-almost all x. 
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Routine calculations show that (bc)~ =b~c~, (b+c)~=b~ +c~, and 
(b*)~ =b~*. If b(x) is unitary for almost all z, then it is obvious that b~ 
is unitary. Conversely, if b~b~*=6b~*b~ =1, then b(x)b(x)* = b(x)*b(xz) =1 
for almost all x, by the essential one-one nature of the map b —> b~. Thus 
b(x) is unitary for almost all x. The assertions involving projection and 
normal operators are proved similarly. The proof of the lemma is complete. 


Our next lemma deals with families of operators commuting with P. 
Lemma 6.5. With notation above, let Y be a standard Borel space and 
(30) b:y, x —> 0(y,2) 


a bounded Borel map of Y x X into S. For each y e Y, let b(y,-) denote the 
map x —> b(y,x) of X into S. Then the map y —> b(y,-)~ ts a bounded Borel 
map of Y into the Borel space of all bounded operators of #. Conversely, 
if y — B, ts a bounded Borel map of Y into the Borel space of all bounded 
operators in H such that B, commutes with P for all y, and if v ts a o-finite 
measure on Y, there exists a bounded Borel map b of Y x X into S such that 


(31) B, = b(y,-)~ 

for v-almost all y. b is determined uniquely (v x «)-almost everywhere. I f each 

B, is unitary, we can arrange matters so that b( y,x) is unitary for all y, x. 
Proof. We may clearly assume that « and v are finite. Let b be a bounded 


Borel map of Y x X into S, say |b(y,x7)| <C, and let B,=b(y,-)~. If f and 
fi are in Æ, 


<B,f: f> = f blye) f(E), f’ (@)ydal2), 


which is Borel in y as the integrand is Borel in y, x. We also have 
|| B,|| <C. Conversely, let y > B, be a bounded Borel map of Y into the 
Borel space of all bounded operators of # and let B, commute with P 
for all y. We may assume that || B,|| <1 for all y. For each y e Y we choose 
a Borel map b, of X into S such that |b (x)| < 1 for all x and B,=b,~. Let 
{e,} be an orthonormal basis for # and for any u e X let us write u for 
the constant map x —> u on X. Then, for ESX, 


(Biys@m :€,> = f b (22) s€,)da(2), 
so that for each Borel set ESX, 
y —> f Cb (2)em,e, dala) 
E 


is a Borel function of y. Lemma 5.5 is now applicable and hence we can 
deduce the existence of complex-valued Borel functions r,,, on Y x X 
such that for each m and n, for v-almost all y, 


(32) (b,(2)ems€n» = Tn,m( Y2) 
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for a-almost all x. There is a set NE Y of v-measure zero such that for each 
ye Y—N and all m, n, (32) holds for a-almost all x. Since |b (x)| <1, we 
deduce from (32) and the Fubini theorem that for (v x «)-almost all y, x, 


(33) >, ltam(¥s2)|? < 00 


for each m, and 


(34) > 


n 


k 


> Tn, m( Y,L)Cry 


m=1 


2 
< foal? +--+ + exl? 


for all k and all k-tuples of complex numbers (c,,---, Ck) whose real and 
imaginary parts are rational. Let Zg Y x X be the Borel set of all (y,x) 
such that (33) and (34) are satisfied for the indicated ranges of m, k, c. 
The complement of Z is obviously v x a-null. Let rž m be defined by 


nan a) e Z, 
(35) rin Yt) = fo a H ne z 


Then the rž „ are Borel functions and the inequalities (33) and (34) are 
satisfied by the rx „ for all y, x. Hence for each y, x there exists an operator 
b(y,x) such that |b(y,x)| <1 and 


(36) D(Y,£)emEn? = T ml Y2). 


Equation (36) shows that (y,x) —> b(y,x) is a Borel map of Y x X into NS. 
Now, fam=T* m (vxa)-almost everywhere on Y xX, so that we easily 
deduce from (32) that for v-almost all y, 


by (L)emln» = DYT) em En? 
for a-almost all x, i.e., for v-almost all y, 
B, = B(y,-)~. 


If b’ is another bounded Borel map of Y x X into S satisfying (31) for 
v-almost all y, it follows from lemma 6.4 and Fubini’s theorem that 
b( y,x) =b'(y,x) for (v x «)-almost all y, x. For the last statement, note that 
if B, is unitary for all y, Z={(y,x) : b(y,x) is unitary} is a Borel set whose 
complement is of (v x «)-measure zero. All we have to do is to redefine 
b(y,x) to be 1 for (y,z) ¢ Z. The proof of the lemma is complete. 


3. FROM COCYCLES TO SYSTEMS OF IMPRIMITIVITY 


We remarked in our example 2 of Section 1 on the connection between 
the cocycle identities (35) of Chapter V and the identities (6). The main 
aim of this section is to show that associated with a cocycle taking values 
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in a unitary group, there is a system of imprimitivity, and that this system 
is determined up to equivalence by the cohomology class of the cocycle. 
This is precisely the reason for our detailed study of cocycles in the previous 
chapter. We begin with an elementary lemma. 


Lemma 6.6. Let G be a lesc group and # a separable Hilbert space. 
Suppose that L(g — L,) is a mapping of G into the set of bounded operators 
in H such that (i) for all f, f E€ Æ, g — <L, f, f> is a measurable function, 
(ii) DL, is unitary for almost all g, and (iii) Lig, = Lo, Lo, for almost all 
(91,92) €GxG. Then there exists exactly one representation U of Gin # 
such that L, =U, for almost all g. 


Proof. Let u be a left Haar measure on G and #1 the Banach space of 
„-summable functions. For f}, fz € Z> we introduce their convolution 


(fi *fa)(g) = f falt)folt-*g)dp(t). 


fı * f2 is almost everywhere finite and defines an element of Z1. Define 
the operator L,, for fe Y', by setting, for u, v e XH 


(37) (Luo) = f flg)<L,u,v>du(g). 


f— L; is linear and |Z, <|- A routine use of the Fubini theorem, 
coupled with (iii) of the lemma, gives us the equation L,,.,;,= L,, Lja 
Moreover, the sum of the ranges of the L, is dense in #; for, if v e # and 


f Foor<tauo>duig) = 0 


for all u e # and fe Zt, then, for each u e Æ, (L,u,v> =0 for almost all 
g. Taking a countable dense set of u’s, we obtain the equation L,*v=0 
for almost all g. Hence v=0. 

We now apply a known theorem (Loomis [2], p. 128) and obtain a 
unique homomorphism g —> U, of G into the group of invertible operators 
in # such that ||U,|| <1 for each g, g — U, is Borel, and for each fe 21, 


f f(g)U,u»>dulg) = L, (ve). 


This implies rather easily that U,= L, for almost all g. U, is therefore 
unitary for almost all g. Now, the set of g, for which U, is unitary, is a 
subgroup. Hence U, is unitary for all g, i.e., U is a representation. The 
uniqueness of U is obvious. 

Let X be a standard Borel G-space. We choose an invariant measure 
class @ on X and fix it throughout the rest of this section. Let «œ be a 
measure in this class. We introduce a separable Hilbert space “ and 
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denote its unitary group by M. Let X = #?(X,%a). By a cocycle we mean 
a (G,X,M)-cocycle relative to a, i.e., a Borel map (9,7) > o(g,x) of G x X 
into M such that the identities (35) of Chapter V are satisfied. The null 
sets on X are those of €, and on G the usual ones. 


Theorem 6.7. Let the notation be as above. For each géG, let r, be a positive 
Borel function which is a version of da/da9—), Suppose that y is a (G,X,M)- 
cocycle relative to a. Then, there exists a unique system of imprimitivity (U,P) 
such that 


(38) Paf = xef, 
and for almost all g, 
(39) (U fE) = {rolg x) plg g- x) f(g~* +x) 


for each fe # for almost all x. Moreover, the equivalence class of (U,P) 
depends only on € and yy, where Ye ts the cohomology class of ¢. 


Proof. Equation (38) defines P, and P(E —> Pp) is a projection valued 
measure in #. Let us now define, for each ge G and fe #, the Borel 
function L,f on X by 


(40) (Lof IE) = {r9(g7t-2)}9lg.g7*-2)f (ga). 


Since p takes values in M, we have: 


(Lo f)(x)|? = rag -x)| f (97 -2)|?, 
from which we conclude that L,fe and ||L,f||=||f||. L, is thus an 


isometry. It is actually unitary because its range is the whole of # In 
fact, iff’ e # and <L, f : f’>=0 for all f e 4 a brief calculation shows that 
o(g,x)*-f'(g-x)=0 for almost all x which gives us the relation f’=0. Next, 
p being a cocycle, 


PGi1G2%) = P9192: £)P(J2:x) 


for almost all (g1,g92:x£) E Gx Gx X. From this we conclude by a routine 
calculation that L,,,=L,,L,, for almost all g,,g2 (for the r, we use 
identities (20) of Chapter V). We now verify that g>L, is a measur- 
able map. We choose a positive Borel function r on G x X such that, for 
almost all g, r(g,x) =1,(x) for almost all x. Then, for f, f’ € 7 


CLF f> = f {r(g,g ~>- x) plg g x) f(g * +2), f"(x)>de(21) 


for almost all g. Since the integrand on the right is.a Borel function on 
Gx X, the integral is a Borel function on G and hence g > <L, f : f’> isa 
measurable function on G. We are therefore in a position to apply lemma 
6.6. Thus there exists a unique representation U of G in # such that 
U,= L, for almost all g. 
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We calculate from (40) that for each ge G, L,P;=P,.,L, for all Borel 
sets Ec X. Therefore, if N is a Borel set of measure zero such that U, = L, 
for ge G—N, then for each ge G—N, 


(41) U,P,U,~* = P;E 


for all Borel sets E. Now, as U is a representation, the set of all g e G for 
which (41) holds for all Borel Æ is a subgroup of G. As the complement of 
this subgroup has measure zero, we conclude that (41) must be valid for 
all ge G and all Borel sets E. (U,P) is thus a system of imprimitivity 
based on X. Its uniqueness is obvious. 

To show that the equivalence class of (U,P) depends only on the 
measure class of « and the cohomology class of », we proceed in two steps. 
First, we replace « by a’ in the same measure class, keeping ¢ fixed. In 
this case, let a>0 be a Borel function such that a=dea'/da. Let (U’,P’) 
be the system of imprimitivity associated with a’ and p. Then 


(42) W : fa tf 


is a unitary isomorphism of L?(X, Xa) onto #?2(X,%a') and the formula 
(39) shows that 


Pg = WP,;W7?} 
for all E, and for almost all g, 
U; = WU,W-?. 


Once again, as the set of all g such that U,’=WU,W—' is a subgroup, 
these equations are valid for all g. Thus (U,P) and (U’,P’) are equivalent. 

It remains to examine what happens when we fix « but change øp. Let 
g’ be a cocycle defining the same cohomology class as g. Let (U’,P’) be the 
system of imprimitivity associated with a, g’. Both (U’,P’) and (U,P) 
act in # =L? (X, Xa). Since p'~ g, there exists a Borel map b, 


(43) b : x — D(x) 
of X into M such that 

p'(g,x) = b(g-x)p(g,x)b(x) ~* 
for almost all (gx) e Gx X. For any fe Æ, 


f(x)? = [o(z) f(a)/?, 


and hence there exists a unitary operator B of L?(X, Xa) such that 
(cf. Section 2) 


(44) B= b~. 
From (44) we obtain 
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for all Borel sets £. Next we compute BU,B~1. We have, from (39), for 
almost all ge G, 


(BU,B-*f)(x) = {r,(g~+-x)}"!*b(x)p(g,.g~ *-x)b(g~*+x)-*f(g~?-2) 
= {r,(g 7+- x) p (gg * +x) f(g~* +x) 


= (U, f)(x) 
for each f e # for almost all x. Therefore 
(46) U; = BU,B- 


for almost all g. Using our usual arguments, we conclude that (46) is 
valid for all g. Equations (46) and (45) show that (U,P) and (U’,P’) are 
equivalent. This completes the proof of the theorem. 


Corollary 6.8. If p is a strict cocycle or even tf we assume that for each 
(91:92) E Gx G, 


P9922) = 9(91,92°X)P(G2,X) 
for almost all x, then for each g € G and each f 


(Ug f(x) = {rolg x) plg g +2) f(g *-2) 
for almost all x. 


In this case, g —> L, itself is a homomorphism so that L,=U, for all g. 
Given g, «œ as above, the system of imprimitivity (U,P) constructed in 
theorem 6.7 is said to be associated with and a. 


4. PROJECTION VALUED MEASURES 


In this book we have come across projection valued measures from 
time to time, but such uses as we have made of them have been quite 
superficial. The aim of this section is to describe some of the deeper aspects 
of the theory of projection valued measures. These results are all known 
and form the Hahn—Hellinger multiplicity theory. We recommend the 
reader to the accounts of Stone [2], Nakano [1], and Halmos [2]. We shall 
confine ourselves to a concise description of the main features of this theory. 

The main problem in this theory is that of determining when two 
projection valued measures are equivalent. More precisely, let X be a 
standard Borel space, #' a separable Hilbert space, and P' a projection 
valued measure in #! based on X. We say Pt and P? are equivalent, 
P~ P? in symbols, when there is a unitary isomorphism W of #! onto 
H? such that Pg? = WPW! for all E. The main problem is, given X, 
to construct canonical forms for the projection valued measures based 
on X. 


We begin by choosing a o-finite measure a on X and a separable Hilbert 
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space X. Let #=L£?(X,4%a) and P be the projection valued measure 
E -» Pp, where 


(47) Pef = xef (JEH) 


We write P=P(%a). The first result of the theory is that P(#a)~ 
P(X’ ,a') if and only if (i) «@ and a’ define the same measure class, and 
(ii) dim #=dim X”. If these conditions are satisfied, if X = X, and if a 
is a positive Borel function which is a version of da’ /da, the map 


W :f—>a- tf 


is a unitary isomorphism of #7(X,4%a) onto #7(X,%a') such that 
P(X a) =W P(X aW. 

This result motivates us to introduce the following definition. A 
projection valued measure P (acting in a separable Hilbert space) is said 
to be homogeneous if it is equivalent to P(%Xa) for suitable X and «. 
The definition (47) of P(#,«) shows that for a Borel set ES X, a(H)=0 
if and only if P;=0. The measure class of a is thus determined by P in a 
very direct fashion. It is called the measure class of P. The dimension of X 
is said to be the multiplicity of P. Given any measure class € on X and 
any integer n (1 <n <œ), there exists, up to equivalence, a unique homo- 
geneous projection valued measure P with @ as the measure class of P 
and n its multiplicity. 

On the other hand, given a projection valued measure P on X, it is a 
deeper matter to determine when it is homogeneous. A vector u € X is 
said to be cyclic for P if the collection {P pu} spans a dense linear manifold 
of #. It can be shown that if there is a cyclic vector for P, P is homo- 
geneous and of multiplicity one and that, conversely, homogeneous 
projection valued measures of multiplicity one do have this geometric 
property, i.e., existence of a cyclic vector. The case of arbitrary multiplicity 
can be handled along similar, but somewhat more complicated, lines. 

Now we pass on to the general case. For each n=, 1, 2,--- we choose 
a separable Hilbert space %, of dimension n and fix it throughout the 
discussion. Choose a sequence of o-finite measures «, @, %,-:+ on X. 
For any o-finite measure « on X let us write Hr ,~=L?(X,%,,0) and 
Pr «= P(X). Suppose that the a, are so chosen that for m#n, a, and a, 
are mutually singular.t Let 


(48) KH = D Haan 


be the direct sum of the #,,,, and for Ec X let P, be defined by 
(49) Ps : (feos Sis fas’ ° -) —> (PE efo PE fi ` ) 


+ o-finite measures a and B on X are called mutually singular if we can write 
X =A U B, where A and B are Borel sets with A A B= ø, B(A)=a(B)=0. 
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(when a, =0 for some n, #, ,, =0). Then P is a projection valued measure. 
We write P = P({. 45}, {«,}). The main theorem of spectral multiplicity theory 
asserts that every projection valued measure on X in a separable Hilbert 
space is equivalent to P({%,}, {«,}) for a suitable sequence {a,}. If {æn} 
and {a,’} are two sequences of mutually singular measures on X, the 
theorem further asserts that 


PAD}, (en) X PLA} ton'}) 


if and only if for each n, a, and a,’ are in the same measure class. 

Given P, we therefore have for each n=oo,1,2,--- a canonically 
determined measure class, say @,. @, is called the measure class correspond- 
ing to the multiplicity n. For a given n, @, may be 0. P is homogeneous if 
and only if €, =0 for all n except a single value no. The construction of the 
Ca from P is delicate. We do not need this for our purposes. On the other 
hand, consider the set of all o-finite measures a with the property that 
a(#)=0 if and only if E is -null for all n (l<n<oo). There exist such 
measures a; in fact, if we choose for each n a finite a, in @, such that 
Èn G(X) <0, then «=>, a, would have the required property. These 
measures a form a measure class, say @. It is obvious that a set F is ¢-null 
if and only if P;=0. @ is called the measure class of P. 


5. FROM SYSTEMS OF IMPRIMITIVITY TO COCYCLES 


We now examine the problem converse to that studied in Section 3. 
We resume our concern with a standard Borel G-space X and a system 
of imprimitivity (U,P) based on X, acting in %. These will be fixed 
throughout this section. The main result is that if P is homogeneous, one 
can construct a cocycle » relative to the measure class @ of P such that 
(U,P) is equivalent to the system of imprimitivity associated with & and 
p. We shall use the notations and results described in Section 4 concerning 
projection valued measures. In particular, for each n(oo, 1, 2,---), Misa 
fixed separable Hilbert space of dimension n. We write M, for the unitary 
group of %,. The norm and inner product in %, are denoted by |.| and 
<.5->, While the norm and inner product in # and Hha (a a o-finite 
measure on X) are denoted by ||. || and ¢.:.). 


Lemma 6.9. With the notation described above, the measure classes 6, 
corresponding to the various multiplicities n are all invariant under the 
action of G. In particular, the measure class of P is invariant. 


Proof. Choose measures æ, €G, Then, P~ P({%;}, {an}. We must 
show that each @, is invariant under G. Let g e @ be arbitrary. Let us 
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write Q= P,.;. Then Q(# —> Qpr) is a projection valued measure in 7% 
The equation 
U, PU, = P£ 


shows that Q~ P. But it is obvious that Q~ P({4,},{a,%}) (via the map 
f— f°) and that the «,’ are mutually singular. Hence for each n, a, and 
æ? define the same measure class. This proves the lemma. 


We recall that a measure class @ on X is said to be ergodic if any Borel 
function f, with the property that f°=f @-almost everywhere for each 
g € G, is constant @-almost everywhere. 


Lemma 6.10. If the measure class € of P is ergodic, then P is homogeneous. 
If (U,P) is irreducible, € is ergodic and so P is homogeneous. 


Proof. Choose mutually singular finite measures a, (n=, 1, 2,---) 
such that we then have P~ P({.%,},{a,}). Let €, be the measure class 
defined by a,. Since the «,, are mutually singular, we can find mutually 
disjoint Borel sets E, (n=oo0,1,2,---) such that X=\)];i snz En and 
a,(X — E,)=0 for all n. Suppose that the measure class @ of P is ergodic, 
but that for two distinct values of n, @,,40. Let a, be distinct real numbers 
and let f be the Borel function which takes the value a, on #,. For each n, 
f is a constant @,-almost everywhere. Since @, 40 implies a,(H,)40, En 
is not @-null for any n for which @, is nonzero. Thus f is not a constant 
@-almost everywhere. But (by lemma 6.9) as each @, is invariant, 
f%=f, @-almost everywhere for each g e Œ. This is a contradiction. We 
must therefore have @, =0 for all but a single value of n, i.e., P is homo- 
geneous. The second statement would follow if we show that @ is ergodic 
whenever (U,P) is irreducible. Suppose now that (U,P) is irreducible but 
@ is not ergodic. Then there exists a real Borel function f such that f is not 
a constant @-almost everywhere but, for each ge G, f%=f, @-almost 
everywhere. Let c be a real number such that, if Eo ={x : f(x) <c}, then 
neither E, nor X — E, is @-null. If g e G, the fact that f(g~!-x)=f(x)= 
f(g-x) for @-almost all x implies that the set (g-H)>—E ) U (Eo —g: Eo) is 
@-null. Hence P .£,= Pr, Since this is true for all g, (6) shows that Pe, 
commutes with all U,. Since P, obviously commutes with all Pp, the 
irreducibility of (U,P) implies that either Pg, or Py_,, is 0, i.e., either Eo 
or X — E, is null, a contradiction. 


Theorem 6.11. Let (U,P) be a system of imprimitivity acting in Æ such 
that P is homogeneous and of multiplicity n (l<n<oo). Let «œ be a o-finite 
measure in the measure class Cp of P. Then (U,P) is equivalent to a system 
(U',P™*) acting in Hna Further, for each cohomology class y of (G,X,M,)- 
cocycles relative to Gp, let X(y) denote the equivalence class of systems of 
imprimitivity acting in Hn a which are associated to y and 6p. Then 


(50) y => Xl) 
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is a one-one correspondence between the set of all cohomology classes of 
(G,X,M,)-cocycles relative to Gp and the set of all equivalence classes of 
systems of imprimitivity of the form (U',P™*). 


Proof. There exists a unitary isomorphism W of # onto #,,, such that 
Pre = WP,W-? 


for all Borel sets Æ. We set U, = WU,W-+. Then (U,P)~(U',P"*). 
Consider now a system (U',P"*). Let us choose, for each g in G, a 
positive Borel function r, such that r, is a version of da/da%~”. For 
simplicity of notation we write HW =n. P’=P”*. Let (V,P’) be the 
Koopman system of imprimitivity, i.e., for each g € G and fe H”, 


(51) (Vif (x) = {rigt — f(g7*-2). 
Write 
(52) W, = V,~1U,’. 


Since both (U’,P’) and (V,P’) are systems of imprimitivity, it follows that 
W, commutes with P, for all Æ. Further, g > W, is a Borel map of G 
into the unitary group of #’. We now use lemmas 6.4 and 6.5. We select, 
for each g € G, a Borel map w, of X onto M, such that 


(53) W, = w”, 

and a Borel map q¢’ of Gx X into M,, 

(54) p :9, £ —> p (9,2), 
such that for almost all g, 

(55) W, = p'(g,:)~. 

For 91, g2 E G, we obtain from (52), 

(56) Wooa = (Voa Wa, Voa) Woz 


A quick calculation shows that 
(57) Voa Wo Vo, = (we?) ~, 
where, as usual, for x e xX, 
w827 (x) = Wy, (g2: 2). 
From (56) and (57) and lemma 6.4, it follows that for each (9,,9,) eG x G, 
(58) Wagal T) = Wy, (92 T)Wg (1) 
for almost all x. Then 


(59) P' (91928) = p'(J192' 2p (J22) 
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for almost all (9,,g2,7)€GxGx X. This proves that q’ is a (G,X,M,)- 
cocycle relative to a. Moreover, as U,’=V,W,, we have, for almost all 
g, U,'=V,¢'(g,-)~, i.e., for almost all g, 


(60) (Ug f(x) = {rolg x) pgg +x) f(g7* +a) 


for fe #’. In other words, (U’,P"*) is the system of imprimitivity 
associated with g’ and «a. 

If p,’ is another cocycle satisfying (60), then, for almost all g, »,'(g,-)~ = 
W,=w,~. This shows that g,'= 9’ almost everywhere on Gx X. 

To complete the proof of the theorem, we have only to show that if 
(U",P’) is a second system of imprimitivity equivalent to (U',P’) and if 
œ” is a cocycle corresponding to (U”,P’), then gp’ and »” determine the 
same cohomology class. Let W be a unitary automorphism of #’ such 
that 

WP,W-1 = Pr, 
WU,"W-) = U,’, 


(61) 


for all g, E. There exists (lemma 6.4) a Borel map w, 
(62) w:x—> wlr) 


of X into M, such that W=w~. But then the equation WU,”W-1=U,, 
together with the relations between »” and U” as well as y’ and U’, show, 
after a brief calculation, that for almost all (gx) e Gx X, 


w(x)p"(g.g~1-x)w(g-1-2)-* = o'(g,g~)-2), 
i.e., 


w(g-xjp”(gxjwlz) t = g'(g,x) 


for almost all (g,x)e Gx X. This shows that gy’~o”. This completes the 
proof of the theorem. 


Remark. It is easy to see that if the cocycle oy’ is a coboundary, the 
corresponding systems of imprimitivity are equivalent to the Koopman 
systems. Thus, the examples of Section 5 in Chapter V give rise to systems 
(U,P) which are not equivalent to the Koopman systems. 


6. TRANSITIVE SYSTEMS 


In this section X continues to be a standard Borel G-space and (U,P) 
a system of imprimitivity based on X. We shall completely describe all the 
transitive systems. Our success is due to the detailed knowledge of co- 
cycles on transitive G-spaces which was accumulated in Section 5 of 
Chapter V. We recall that a measure class @ is said to be transitive if 
there exists 7, € X such that X—G-z, is a @-null set. In this case the 


SYSTHMS OF IMPRIMITIVITY 223 


orbit G- xo is uniquely determined by @ and we shall say that @ lives on 
G- xo. A system of imprimitivity (U,P) is said to be transitive if the measure 
class @ of P is such; if X’ is the orbit on which @ lives, we say that the 
system lives on X’. Note that if (U,P) is transitive, the measure class of 
P is ergodic, and hence P is homogeneous (lemma 6.10). If eX and 
X’=G-x, X’ is an invariant Borel set and is hence a standard transitive 
Borel G'-space in its own right. If a’ is a o-finite quasi-invariant measure 
on X’, the measure a : E — a'(H © X’) defines a measure class on X which 
is invariant and which lives on X’. Obviously it is the only measure class 
living on X’ with this property. 

Let x) € X and X’=G-x,. Suppose that G,,=G is the stability sub- 
group at 2. Let m be a representation of Go in the Hilbert space %, 
(which we have chosen once for all) of dimension n (l1<n<oo). Then m 
is a Borel homomorphism of G into M,. From the results of Chapter V 
we know that there exists a strict (G,X',M,)-cocycle po, which defines the 
homomorphism m of Go at a. We extend po to a strict (G,X,M,)-cocycle 
p, forexample, by putting 9(g,v) = 1 for x ¢ X’. We choose a o-finite measure 
œ which is quasi invariant and lives on X’. We can then construct the 
system of imprimitivity in 4, ,, associated with p and «. We shall call 
such systems, obtained by varying «, p (but «œ living on X’) systems 
induced by the representation m of Go. It is obvious that all such systems are 
mutually equivalent, homogeneous, transitive, and live on X’. We now 
prove the basic theorem of this section. 


Theorem 6.12. Let X be a standard Borel G-space and let (U,P) be a 
transitive system of imprimitivity based on X such that (i) P is of multi- 
plicity n, and (ii) the measure class of P lives on the orbit X' =G. x. Then 
there exists a representation m of the stability group Go at xo in X, such that 
(U,P) is equivalent to a system induced by m. The equivalence class of m is 
determined uniquely by the equivalence class of (U,P) and gives rise to a one- 
one correspondence {(U,P)} — {m} of the set of all equivalence classes of 
transitive systems living on X’ onto the set of all equivalence classes of repre- 
sentations of Gy. Moreover, the commuting ring of (U,P) is isomorphic to the 
commuting ring of m. In particular, (U,P) is irreducible, or a direct sum of 
irreducible systems, if and only if m is so. 


Proof. If m, and m, are inequivalent representations of Go, theorem 
5.27 shows that the (G,X’,M,,)-cocycles which correspond to them define 
distinct cohomology classes. Therefore we can conclude from theorem 
6.11 that the systems of imprimitivity induced by m, and mz, are in- 
equivalent. By the same token, if m, and m, are equivalent, so are the 
systems that are induced by m, and mg. 

Next, let (U,P) be a transitive system which lives on X’. Let « be a 
o-finite measure belonging to the measure class of P. o(X —X')=0 
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obviously. Theorem 6.11 now tells us that there exists some n and a 
(G,X,M,)-cocycle » relative to « such that (U,P) is equivalent to the 
system associated with p and a. Now, the restriction of p to Gx X’ is a 
(G,X',M,,)-cocycle on the transitive space X’ and hence by lemma 5.26 
there exists a strict cocycle g’ on G x X’ which defines the same cohomology 
class as g (restricted to Gx X’). If m is the homomorphism of G, which is 
defined by gq’, it is clear that (U,P) is equivalent to a system induced by m. 

The only thing that remains to be proved is the statement about the 
commuting rings. We assume (as we may) that X =X’. Let p be a strict 
(G,X,M,,)-cocycle defining the homomorphism m at xp of Go. We choose a 
quasi-invariant o-finite measure a on X and write W =, ,. In our usual 
notation, for each g e G and Borel ESX, 


Prf = xef, 
(Ug f(x) = {rolg 2) P plg g x) (g a). 
We use the results and notation of Section 5, Chapter V. We select a 
Borel map d(g —> d(g)) of G into M, such that d(gh)=d(g)m(h) for all 
(9,h)EGxG, d(e)=1, and 9°(g9,9')=d(gg')d(g')~-1 for all (g,g')EGxG 


(P°(9,9') = 99,9" *%o))- 
Suppose that T is an operator in # which commutes with all Pz and 
all U,. By lemma 6.4 we can choose a Borel map t, 


(64) t : x —> t(x). 


(63) 


of X into the Borel space S, of all operators in %, such that |t(x)| < |7| 
for all x and 


(65) T =t. 


If we use the fact that T commutes with all U,, we obtain, after a brief 
calculation, the identity 


(66) P(9.g~*+a)t(g-*-x) = t(x)plg,g x) 
for each g for almost all x, i.e., for each g e€ G 
(67) plg x)t(x) = t(g-x)p(g,x) 


for almost all x e X. From (67) we get, on writing ¢°(g) =t(g- 2p), 
P°(9.9' lg) = igg olg’) 


for almost all (g,g') e G x G. Substituting the expression for o° in terms of 
d and simplifying, we conclude that there exists a bounded operator r in 
X, such that 


(68) t°(g) = d(g)rd(g)~* 
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for almost all g. Replacing g by gh (h € Go) and simplifying, we obtain 
(69) m(hjrm(h)-t = r. 


This shows that 7 is in the commuting ring of m. On the other hand, if we 
start from a r in the commuting ring of m, the map g —> d(g)rd(g~") is a 
bounded Borel map of X into S, which is constant on the left Go-cosets. 
Hence there is a bounded Borel map ¢ from X into S, such that ¢° is given 
by (68) for all g. Routine calculation leads us from (68) to (67) to (66). At 
this stage, we know that T =t% is in the commuting ring of (U,P). 

The correspondence 7'’z 7 is obviously one-one and is an algebra 
isomorphism as is easily seen from the formulas (68) and (65) connecting 
T and r. Moreover, as each d(g) is unitary, (68) also implies that 


(70) 1°(g)* = d(g)r*d(g)—°, 


so that T <7 is adjoint-preserving. Since the commuting ring of a 
representation (or a system of imprimitivity) determines completely the 
circumstances under which it is irreducible or a direct sum of irreducibles, 
the proof of the theorem is complete. | 


Corollary 6.13. Let X be a standard transitive Borel G-space, x) EX, 
and Gy the stability group at xo. Then every system of imprimitivity based on 
X is homogeneous and its measure class is the unique invariant measure 
class on X. There is a canonical one-one correspondence between the equi- 
valence classes of systems based on X and equivalence classes of representa- 
tions of Go. This correspondence preserves irreducibility as well as the 
property of being a direct sum of irreducibles. 


Remark. Theorem 6.12 gives a complete analysis of the transitive 
homogeneous systems of imprimitivity based on X. It is natural to ask 
what happens when we give up the assumption of transitivity. According 
to lemma 6.10 if the measure class ¢ of P is ergodic, then P is homo- 
geneous. Theorem 6.11 then tells us that we have only to study the 
cohomology classes relative to @. However, the examples of Section 5 in 
Chapter V show that the cohomology classes relative to an ergodic 
invariant measure class are incredibly more complicated than in the 
transitive case. In view of this it has not been possible to carry out the 
analysis of ergodic systems of imprimitivity further than that of theorem 
6.11. 


We note here that if there is any ergodic invariant measure class @ on 
X, then one can construct irreducible systems of imprimitivity (U,P) 
based on X such that @ is the measure class of P. In fact, let « be a o- 
finite measure in @ and let » be any (G,X,M,)-cocycle relative to @. 
Let #=H,,, and let (U,P) be the system of imprimitivity associated 
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with » and a. We may take %4 to be the complex number space so that 
H = L?(X,a). We claim that (U,P) is actually irreducible. In fact, let T 
be an operator commuting with (U,P). By our lemma 9.4 there exists a 
complex valued, bounded Borel function ¢ such that 


(TFN) = tx) f(z) 


for all fe #. The condition that T commutes with all U, gives us (67). 
But, as t(x) is a complex number, it commutes with ¢(g,x); thus for each 
ge G, 


t(g-x) = t(x) 


for almost all x. The ergodicity of & now implies that t is a constant almost 
everywhere. T is thus a multiple of the identity showing that (U,P) is 
irreducible. In other words, the limitations imposed by the assumption of 
transitivity are quite real. 

A useful direction of investigation would obviously be that of determining 
when every ergodic invariant measure class is transitive. We formulate 
two lemmas, each of which gives sufficient conditions for this to happen. 


Lemma 6.14. Let X be a standard Borel G-space. Suppose that there exists 
a Borel st DEX which intersects each orbit in exactly one point. Then, 
every ergodic invariant class on X 1s transitive. 


Proof. The proof requires the use of the deep result (cf. remark follow- 
ing corollary 5.2) on the universal measurability of the ranges of Borel 
maps. Let « be a finite measure of an arbitrary ergodic invariant measure 
class. We normalize « so that a(X)=1. If E is any Borel set ¢ D, the set 
G-E, being the image of the standard space Gx E under the Borel map 
g, x —> g-x, is an a-measurable subset of X. We define 


(71) «~ (E) = «(Q E). 


Now the fact that D meets each orbit exactly once implies that, for dis- 
joint E, the sets GE are also disjoint. Hence, «~, as defined by (71), is a 
measure on the Borel space defined by D and a~(D)=1. We claim that for 
any Borel set Ec D, a~(E#) is either 0 or 1. If this were not so, we can 
choose a Borel set ES D such that 0<o(G-H)<1 and the characteristic 
function y of the set G-E would be an invariant «-measurable function 
which is not a constant almost everywhere. Let y’ be a Borel function on 
X which is a-almost everywhere equal to y. Then, for each g, y'(g-x)= 
y’(z) for a-almost all x. This would then contradict the ergodicity of the 
measure a. 

Thus «~ is a two-valued measure on D. Now, choose a sequence {En} of 
Borel subsets of D which separate the points of D. By replacing E, by 
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D—E,, whenever necessary, we may assume that a~(H,)=1 for all n. 
Put 


(72) By = (\ Bhi 


a~(Ho)=1. Since the sequence {E} separates the points of D, Hy must 
consist of a single point, say xo. But then a(G-2,)=1, showing that « is 
transitive. 

A variation of the argument given just now leads to: 


Lemma 6.15. Suppose that there exists a sequence {fa} of invariant real 
valued Borel functions on X which separate the orbits of X, i.e., two points 
x, y of X lie on the same orbit if and only if f,(x)=f,,(y) for all n. Then every 
ergodic invariant measure class on X is transitive. 


Proof. Let œ be a measure such that o(X)=1 and let « be quasi invariant 
and ergodic. Let Z’ be the smallest Borel structure on X which makes all 
the fa Borel. Every set in Z’ is a Borel subset of X which is also invariant. 
Hence each such set has «-measure either 0 or 1. Let {#,,} be a sequence 
of sets which generates @ such that a(#,)=1 for all n. If we put 


(73) E, = Ñ Ew 
n=1 


then a(#,)=1. Now the class of Borel sets which either contain Hy or are 
disjoint from it is a c-algebra containing all the #,. Hence every set in 
B' has this property. This implies that Ey is a single orbit; for, if x) € Eo 
and X’=G'-xp, the fact that the f, separate the orbits implies that 


(74) X = N {x : falx) = Snrl%o)}; 
and consequently that X’'eZ'; since X' ^ Eo Ø, ESX’, so that 
A’ = Ep. 

Corollary 6.16. If G is compact, then every ergodic invariant measure class 


on X 18 transitive. 


Proof. Using theorem 5.7 it is easy to come down to the case when X 
is a compact metric space and G acts continuously on X. We shall verify 
that the condition of lemma 6.15 is satisfied. By the Stone-Weierstrass 
theorem, C(X) is separable. Let {u,} be a countable set dense in C(X), and 
for u e C(X) let 


(75) a(z) = f ulg-)dulg), 


where u is the (normalized) Haar measure on G. % is continuous on X. 
Clearly each Ñ is invariant. We claim that {u,} separates the orbits in X. 
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Let x, y e X such that @,(x)=u,(y) for all n. Since {u,} is dense in C(X), 
we deduce from (75) that %(x)=U(y) for all u e C(X). Suppose now 2 and y 
were not in the same orbit. Then G2 and G-y are closed disjoint subsets 
of X, and hence there is a u e C(X) with O0<u<1 such that u=0 on G- a 
and =1 on G-y. It is obvious that 7=0 on G-x and =1 on G-y. This 
contradiction proves the corollary. 


If every ergodic invariant measure class on X is transitive, we shall say 
that X has a smooth orbit structure. 


7. EXAMPLES AND REMARKS 


We shall now discuss a number of examples. The simplest example of a 
transitive space is G itself. If (U,P) is a system based on G, it is transitive 
and its measure class is the invariant measure class of G. We now apply 
the results of Section 6. The stability group at ee G is {e} whose only 
representations are trivial. Any trivial representation is a direct sum of 
trivial one-dimensional representations. We thus obtain 


Theorem 6.17. Let G act on itself by left translations. Let # = L?(u), 
where n is a left Haar measure on G, and let P and U be defined by: 
Prf = xe-f, 
(U, f2) = f(g *2). 
Then (U,P) is an irreducible system of vmprimitivity based on G. Any 


irreducible system based on G is equivalent to this one. An arbitrary system 
of imprimitivity is a direct sum of irreducible systems. 


(76) 


Rotations in the Plane. Let X =C, the complex plane, and let G be the 
multiplicative group of complex numbers of modulus 1. We put, for z e G 
and ze (, 

Z£ = ZT. 


X is a Borel G-space. For each d >20, 
E, = {a : |z| = d} 


is an orbit and these are all the orbits. Since G is compact, all ergodic 
measure classes are transitive. For d>0, the stability groups associated 
with E, are trivial and hence, corresponding to each d>0, there exists, 
up to equivalence, exactly one irreducible system living on Hy. We shall 
now describe this. Note that E, has an invariant measure, say o,such that 
o(#,)=1. If we define the measure o, by 


fla)deg(x) = f f(dæ)do(z), 
Eq Ey 
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then o,(H,)=1 and it is the unique invariant measure living on EK, with 
this property. Let %, be the Hilbert space Y?(c,). Then, if we define 
(U$, P?) by 
P’ = xef, 
(UTE) = f(z-*a), 


(U4, P4) is the unique irreducible system living on E4. 

When d=0, #,={0} and the stability group is G itself. In this case, P 
is trivial and U can be an arbitrary representation of Œ. If (U,P) is to be 
irreducible, U will have to be a character of G. 

The example discussed just now can be generalized. Let us assume that 
X= R" and G=SO(n), the group of orientation preserving rotations. 
The orbits in X are the spheres S, of radius d>0 and the origin 0. Let us 
choose d > 0 and consider the point 


x, = (0, 0,- +, 0, d). 


Then the stability subgroup x4 is canonically isomorphic to SO(n—1). 
Since G is compact, we know from corollary 6.16 that there are no ergodic 
systems other than the transitive ones. For each d>0, there are denum- 
erably many inequivalent irreducible transitive systems living on S,; 
these correspond one-one to the equivalence classes of irreducible 
representations of SO(n — 1). 

However, when we proceed to describe explicitly the irreducible systems 
living on S,, for example, we run into a difficulty. The theory developed in 
Section 5 of Chapter V and in the present chapter directs us to start 
with an irreducible representation, say m, of SO(n—1) and construct a 
strict (G,S,)-cocycle which defines the representation m at the point 
(0,---, 0, 1). Such cocycles can be constructed, of course, but they depend 
on the choice of a section for the left coset space SO(n)/SO(n—1). Since 
canonical sections do not exist, our theory does not yield any simple 
geometric description of the unique irreducible system of imprimitivity 
living on S, which corresponds to m. 

This difficulty, although of no theoretical importance, can be quite 
irritating when we want to make any calculation. We shall next proceed 
to discuss a geometric method of writing down systems of imprimitivity 
which does not rely on the construction of sections. This method is of 
course not very general, but it covers most of the cases which occur in 
physics. 


Special Descriptions. Let X be a standard transitive Borel G-space, 
£o E X. Let Go be the stability group at 2). We assume that X possesses an 
invariant o-finite measure, say a. Let m be any representation of Gy in a 
separable Hilbert space X. We assume that there exists a separable Hilbert 
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space X” and a Borel homomorphism m’ of G into the group of invertible 
operators of X” such that 

(a) X ts a closed linear manifold of X”, 

(b) m’(h)ju = m(h)u forallheG, and ue X. 
Let |:| and ¢.,.> denote the norm and inner product in X’. Note that the 
operators m’(g) for g € Go need not be unitary. Let us consider now any 


point xe X. Since X is transitive, there exists a g e G such that g-x)=z. 
We now define the closed linear manifold #,<.%" by: 


(78) A, = m' (gi 4). 


(77) 


Equation (78) defines X, without ambiguity. We now introduce an inner 
product <.,.>, in Xz, by choosing g e G such that g-2) =a and putting 


(79) CUM, = <m (gjum (gw) (u, v E 45). 


Once again, (79) is well defined. The fact that X, and ¢.,.>, are well 
defined and that m’ is a homomorphism of G implies that 


(80) Box = m'(9)[ Az], 

(81) CM'(9)U,M'(9)0>g.2 = (UY) x 

for all (g,z)e€GxxX. In particular, m’(g) is a unitary isomorphism of 
HA, onto 4.5. 


We now introduce the vector space VY of a-equivalence classes of Borel 
maps f: 


(82) [:X> kX", 
such that 
(83) f(x) E€ 4, 


for all xe X. If f, f’eW (as usual we make no distinction between 
functions satisfying (83) and the equivalence classes which they define), 
we observe that x —> ¢f(x),f'(x)>, is a Borel function. To prove this, we 
choose a section c, i.e., a Borel map c of X into G, 


(84) c: x —> e(z) 

such that c(x)=e and 

(85) C(x)-% = x 

for all z. Then, 

(86) CF) S (t)>2 = Cm'(c(x)~*) f (£),m (e(z)? (x) 


and, as the map g —> m’(g) is Borel from G into the Borel space of operators 
on X’, the Borelian nature of x>¢ f(x), f’ (x)}a follows from corollary 6.3. 


SYSTEMS OF IMPRIMITIVITY 221 
For fe Y, we put 


(87) If? = f (f(E), f (2)yzda(2). 
Let 

(88) H = {f : |f]? < oo}. 
For f, f'e Æ. 

(89) fif = f: CFE) f(a) gee) 


is clearly finite. <.:.> is an inner product for @ We define for g e G and 
EcxX, 


Prf = xef, 
(U,f)(x) = m'(g)f(g~*-2), 
for all fe W It is clear that P,f and U,fe¥, that Px and U, are linear 


transformations of ¥Y, and that both U, and Pp leave # invariant; it is 
also clear that U,P,;=P,.,U, for all g e G, and Borel sets Ec X. 


Theorem 6.18. # is a separable Hilbert space, and (U,P), as defined by 
(90), is a system of imprimitivity in A. Moreover, the equivalence class of 
(U,P) is the one induced by the representation m of Go. 


(90) 


Proof. We shall use the section c : x —> c(x) satisfying (85) for all x e X. 
With the help of c we map # on the Hilbert space #°(X,.%,a). We define 
for fe ¥, Jf by 


(91) (Jf)(x) = m'(c(x)~*) f(z). 


Since x —> f(x), g—>m’'(g), and x—>c(x)~! are Borel maps with appro- 
priate domains and ranges, it follows that Jf is a Borel map of X into X. 
Moreover, (86) implies that 


(92) f(z), fae = IFN), (Sf\(a)>. 


J is also linear and one-one. Therefore we conclude that J is a linear 
isomorphism of # onto #7(X,%;«) which preserves the inner products, 
1.€., 
S:S = JF: I Jew). 

This proves that # is a separable Hilbert space and that J is a unitary 
isomorphism of # onto #7(X,4a). 

We then put 
V, = JU,J™?, 


(93) 
Q; = JPJ ~t. 
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(V,Q) is a system of imprimitivity based on X, acting in #7(X,%a). 
Using (90) and (91), an easy computation reveals that for fọ € £7(X,.%a), 


(94) Qef 0 7 xef 0 
and 
(95) (Vo fo)(%) = m'(c(x)~ *gce(g~*-x)) fol(g-*-2). 


Now, the element c(g-x)~1gc(x) lies in Gp because it sends xo to c(g-x)~} 
. (g-x)=Zxo. Therefore, if we define 


(96) (9%) = m(c(g-x)~*ge(zx)), 
then g, x —> ọ(g,x) is a Borel map of G x X into the unitary group M of X. 


Moreover, the fact that m’ is a homomorphism leads easily to the assertion 
that g is a strict (G,X,M)-cocycle. If x =x and g € Go, then 


(97) (9.x) = m'(g) = m(9). 
Therefore p defines the representation m at x). A comparison of (95) and 
(96) shows that 


(98) (Vofo)(x) = p(g.g~*-%) folg- x). 
Equations (97) and (98) show that (V,Q) is the system associated with m. 
Since (U,P)~(V,Q), the proof of the theorem is complete. 


We shall discuss a number of situations where the conditions (77) 
connecting Go, G, and m are satisfied for every irreducible representation m 
of Go. 


(i) G is compact. In this case, let m be an irreducible representation of 
Go. If we decompose the representation of G, induced by m, into irreducible 
components, and select any irreducible, m’ say, which occurs in the 
decomposition, it follows quite readily from the Frobenius reciprocity 
theorem (cf. Weil [1]) that the restriction of m’ to Go contains m as a 
subrepresentation. 


(ii) G is a connected, simply connected complex semisimple Lie group, 
Go compact. Here, G is known to be unimodular so that X admits an 
invariant Borel measure. From the general theory of the semisimple 
groups (cf. Helgason [1]), we know that there exists a maximal compact 
subgroup K containing Gy. Let m be an irreducible (finite dimensional) 
representation of G, acting in X. By (i) above, there exists a finite dimen- 
sional Hilbert space X” containing X and an irreducible representation 
m’ of K in X” such that m’(h)=m(h) for all h e Go. Now, it is well known 
(cf. Weyl [2], p. 267) that there exists a homomorphism m” of G into the 
group of invertible operators of X” itself such that m”(k)=m' (k) for ke K, 
and that m” is even unique if we require that the map g —> m”(g) is 
complex analytic on G. Thus the conditions of theorem 6.18 are satisfied 
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in this case. In particular, if Go itself is maximal compact in G, the systems 
of imprimitivity based on G/G, can be described in the geometric form of 
theorem 6.18. We shall do this in Chapter IX, when G=SL(2,C) and 
G,=SU(2,C), in connection with the equations for the Dirac electron. We 
thus have: 


Theorem 6.19. Let G be a connected, simply connected complex semisimple 
Lie group, and Go a maximal compact subgroup of G. Let « be an invariant 
Borel measure on X =G/G. Let m be an irreducible representation of Go in a 
Hilbert space XH, and let m'(g —> m’(g)) be the unique complex-analytic homo- 
morphism of G into the group of invertible linear endomorphisms of X such 
that m'(h)=m(h) for all he Gy. Then there exists exactly one map x —> 
<.,->2 of X into the set of positive definite inner products on X x X such 
that <.,. Xz, =<.. > and 


CU, Vr = (m (gju, m" (9)U >. x 


for all (g,z)EGxX. If X isthe Hilbert space of Borel maps f of X into X 
such that 


(99) Ifl? = f (f(E) f (at) ede) <00, 
and we define P and U by 
Pef = XeÍ, 
(U,f)(z) = m'(g)f(g7*- x), 


then (U,P) is a system of imprimitivity equivalent to the system induced 
by m. 


(100) 


Representations in Vector Bundles. The reader might have noticed that 
the essential point of the proof of theorem 6.18 is contained in the fact 
that the family of subspaces z > X, transform “covariantly” under the 
action of G. It is natural to suspect that a more general formulation of 
theorem 6.18 can be obtained in the context of such covariant families. 
This is indeed so, and the natural setup for formulating this is that of a 
vector bundle, more precisely, a Hilbert space bundle. 

Let X and B be both standard Borel G-spaces and let X be transitive. 
We assume that there is a Borel map 


(101) "m: B—>X 
of B onto X with the property that for each g € G, the diagram 
B =— B 


(102) a| |= 
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is commutative. If we write, for each g e G, D(g) for the automorphism of 
B induced by g, and x->g-x for the automorphism of X induced by g, 
(102) translates to 


(103) m(D(g)(b)) = g-7(b) 


for each b e B and each g e G. We shall assume also for simplicity that X 
admits an invariant o-finite measure, say «. 
We shall say that B is a G-Hilbert space bundle if, for each ze xX, 


(104) B, = n` *({z}) 


is a separable Hilbert space whose natural Borel structure is the one 
induced on B, by B and if, for each ge G and x e€ X, 


(105) | b—> D(g)b (be B,) 


is a unitary isomorphism of B, onto B,.,. If we write |-|, and ¢.,.>, for 
the norm and inner product on B,, (105) means 


(106) b,b z = <D(g)b,D(g)b">,.2 (b, b’ € B,). 


In particular, if G, is the stability subgroup at x, D(g) leaves B, invariant 
for g e G, and g — D(g) defines a representation of G, in B,. 
We shall now consider sections of the bundle. A section of B is a Borel 


map f, 

(107) f: x— f(x) 

of X into B such that f(x) e B, for all x e X. To handle the measurability 
problems involving these sections, we transform them into Borel maps of X 


into a single Hilbert space. Choose a point x, € X and a Borel map c from 
X into G such that c(xọ)=e and 


C(%)-% = x 


for all xe X. Since the map g, b —> D(g)b of Gx B into B is Borel, it 
follows quite simply that for any section f of B, x —> D(e(x))~ tf (x) is a 
Borel map of X into B,,. Conversely, if x —> fo(x) is a Borel map of X into 
Bz then x —> D(e(x))fo(x) is a section of the bundle B. We denote by y 
the vector space of «-equivalence classes of sections. 

For any x € X, and any section f, 


<f(2),F(@)>2 = <D(e(x))~ f(x), D(e(x))~"F (2) >: 


This shows that x —> |f(x)|2 is a Borel function on X. We shall say that f 
is a square integrable section if 


(108) Il? = f oldala) < oo 
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If # denotes the subset of Y consisting of square integrable sections, then 
H is a pre-Hilbert space under the inner product 


(109) Sif = Í Cf (2) f’ (2)yadala). 


The isomorphism, which associates with each section f, the map x —> 
D(c(x)) ~ f(x), induces a unitary isomorphism W of # onto £7(X,B,,,«). 
HA is thus a separable Hilbert space. 

For any Borel set ESX we write 


(110) Pef = xef (JE). 
For each g e G, we define the operator U, by 
(111) (U,f\(z) = Dg)f(g-*-2) (JER) 


Computations similar to those of theorem 6.18 then give rise to the 
following theorem. 


Theorem 6.20. (U,P) is a system of imprimitivity. If D? is the representa- 
tion of the stability subgroup G+, (of G at xo) in the Hilbert space B,,, then 
(U,P) is equivalent to the system induced by D°. 


As an interesting example of representations in such bundles: we 
mention the following. Let X be a C® Riemannian manifold, and let each 
g € G be an isometry of X. We assume that G is a Lie group acting transi- 
tively on X in such a fashion that g, x > g -x is a C”-map of G x X into X. 
Let B be the tangent bundlet of X. For each g, let D(g) be the differential 
of the isometry x —> g -x. Then all the conditions of the preceding theorem 
are satisfied. The system of imprimitivity thus acts on the Hilbert space 
of square integrable vector fields and is equivalent to the system induced 
by the representation of the stability group Go at x) € X defined in the 
tangent spacef to X at xo. 

In Chapter IX, we shall see how the representations associated with 
the photon can be formulated, in the spirit of theorem 6.20, as acting in 
the Hilbert space of sections of a certain vector bundle on the light cone, 
leading thereby in a very simple way to the equations of Maxwell. 


Ergodic Intransitive Systems. The last example is simply that of a 
standard Borel G-space which has ergodic nontransitive (invariant) 
measure classes. We take real numbers Aj, A2,--- which are rationally 
independent and take X to be a torus having its dimension equal to the 
number of A’s (finite or infinite). Let G=R}, and for teR! and 


(112) x = (C1, batt) (él = l for all 7) 
t Complexified. 
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in X, let 
(113) tex = (etat, eaba). 


Then, it is classical (Kronecker) that each orbit is dense and that the 
Haar measure on X is invariant and ergodic but not transitive. 


8. SEMIDIRECT PRODUCTS 


We shall now use the theory of systems of imprimitivity to obtain the 
description of all irreducible representations of an interesting class of 
lese groups which are neither compact nor abelian. This class of groups 
includes the inhomogeneous Lorentz group and the group of Euclidean 
motions as members. For the special case of the Lorentz group, the 
analysis which we shall give now was carried out by Wigner in his 1939 
paper. The general study is due to Mackey and comes out as a simple 
consequence of the work in Section 6. 

We begin by introducing the concept of semidirect products. Let A and 
H be two groups and for each h e H let 


(114) t, : a — hla] 


be an automorphism of the group A. We shall assume that h — t, is a 
homomorphism of H into the group of automorphisms of A so that 


hla] = a forall ac A if h = ey, the identity of H, 


(115) bn she = bn the (hiho E H). 
Equation (115) converts A into an H-space. We shall now make 
G=HxA 


into a group by defining the multiplication in G by 

(116) (h,a)(h’,a’) = (hh’at,{a’]). 

It is easy to verify that this definition converts G into a group with 
e= (ey,€4) as its identity (e, being the identity of A). Further, 

(117) (h,a) = (h71,h-4Ya-+)). 

G is called the semidirect product of H and A relative to t and we put 

(118) G = Hx,A. 


When no confusion can arise as to what ¢ is, we omit it, and write 
G=Hx' A. But it must be remembered that (i) A and H play very unlike 
roles in the formation of G, in contrast to the construction of direct 
products, and (ii) G can be formed only after specifying t, i.e., specifying 
how H “acts” on A. 
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Suppose now that A and H are lcsc groups and that the map 
(119) h, a —> h{a] 


is continuous from H x A into A. Then, if we equip G with the product 
topology, it follows quite easily from (116) and (117) that G becomes 
a topological group. Clearly, G is a lesc group also. We shall denote this 
topological group also by H x,A. We shall fix A, H, and t throughout 
this section. 

A quick calculation shows that 


(120) (h,a)(h’,a')(h,a)~* = (hh'h-} ah[a'lhh'h-*[a-*]). 
It follows from (120) that 
(121) A~ = {(¢y,0')} 


is a closed normal subgroup of G and that 
(122) (h,a)(€y1,0')(h,a)~* = (€y,ah[a'ja~*). 


In particular, the inner automorphism of G induced by (h,e,) coincides on 
A with the automorphism (e,,a’) > (ey,h[a’]). We put 


(123) H~ = {(h,e,)}; 


then H~ is a closed subgroup of G. Identifying A with A~ and H with H~ 
we find 


G = AH, 
(124) fh = ANA, 
h{a] = hah7?. 


The inhomogeneous linear groups are typical examples of semidirect 
products. In these examples, A is a real (or complex) finite dimensional 
vector space and a —> [a] is an invertible linear endomorphism of A. H 
is thus identified with a subgroup of the group of all invertible (linear) 
endomorphisms of A, which is usually closed. The Euclidean group and 
the inhomogeneous Lorentz group fit into this general framework. For 
the Euclidean group, A=R" and H is the group SO(n). For the 
inhomogeneous Lorentz group, 4=R‘ and H is the group of all in- 
vertible linear transformations of Rt which preserve the quadratic form 
Lo? — L1? — L2 — £3? (Xo, X1, Xo, Xz are coordinates on A). It undoubtedly 
would be of great interest therefore to develop a theory of the representa- 
tions of semidirect products. In complete generality this has not been 
done. However, under certain restrictive hypotheses, we shall construct 
a theory of representations of semidirect products which goes far enough 
to yield all physically important results. From now on, through the rest of 
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the section, we shall assume that A is a closed, abelian, normal subgroup of G, 
H a closed subgroup of G such that the conditions (124) are satisfied. 

The first step of our theory is to obtain the representations of A. The 
irreducible representations are the characters of A, i.e., continuous homo- 
morphisms of A into the group of complex numbers of modulus 1. Let 
A be the set of all characters-of A. Under pointwise multiplication, A 
becomes a group also. We shall equip A with the topology of uniform 
convergence on compacta. Then it can be shown that A becomes a 
topological group and is in fact a locally compact Abelian group satisfying 
the second axiom of countability (cf. Pontrjagin [1]). Let us then take a 
o-finite measure « on Â and consider the Hilbert space # = #?(A,a). For 
each a € A and fe # let (U,f)(x) =2x(a)f(x)(x e A), where x(a) is the value 
of the character x at a. Then a —> U, is a representation of A. These 
representations do not exhaust the representations of A. To obtain more 
general representations we replace « by a projection valued measure P 
based on A. Let # be a (separable) Hilbert space and P a projection 
valued measure based on A acting in # For each ae A let U, be the 
operator in # defined by 


CUah.S> = | wlaldryy (a), 
where 
vypg(H) = <Prf f). 
We shall symbolically write 


(125) U, = f a(a)d P(x). 


It is easy to show that U, is unitary and that a —> U, is a representation 
of A in #. Conversely, it is a known theorem that if U is any representa- 
tion of A in & there exists a unique projection valued measure P based 
on A such that 


(126) U, = f a(a)d P(x) 


for allae A. P is called the projection valued measure corresponding to U. 
The formula (126) shows how U can be constructed from P. Thus, for 
instance, it is quite clear that if T is an operator commuting with all Pg, 
then T commutes with all U,. Conversely, let T commute with all U,- 
Then 7 commutes with all the spectral projections of all the U,. Now, if 
nq is the continuous map x—> x(a) of A into the group M of complex 
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numbers of modulus 1, it follows from (126) that Q°(F — P,-1()) is a 
projection valued measure based on M and 


U, = f zdQ*(z). 


This shows that the P,-1(,) are the spectral projections of U, and hence 
that T commutes with all of them. Since the sets of the form y,~ }(F) 
generate the Borel structure of A, it follows that T commutes with P. 
The reader who wants to study the (Fourier) analysis of representations of 
A may consult Loomis [2, Chapter VIIJ. 

Let us now consider the lcsc group G and a representation U of Gina 
separable Hilbert space # Our method of analysis of U is essentially the 
Fourier analysis of the restriction of U to A. We begin this analysis by 
first constructing the “adjoint” action of H on A. 


Lemma 6.21. Let he H. Then, for each xe A there exists one and only 
one y e A such that 


(127) y(a) = x(h-*[a)) 


for all ac A. If we write y=h{x], then h, x — h[x] ts continuous from 
Hx A into A and A becomes a H-space. 


Proof. That y is unique and well defined is obvious. The remaining 
assertions are easy to prove. We omit the proofs. 


Not every representation of A in # can be enlarged to a representation 
of Gin # Our next question is: when can this be done? 


Lemma 6.22. Let U and V be representations of A and H, respectively, 
in a separable Hilbert space H and let P be the projection valued measure on 
A corresponding to U. Then, a necessary and sufficient condition that there 
should exist a representation W of Gin # whose restrictions to A and H 
are U and V, respectively, is that (_V,P) is a system of imprimitivity for H, 
based on A. In this case, W is unique. 


Proof. Let W be a representation of G in # and let U,=W, (a € A), 
V,=W,, (he H). U and V are representations of A and H, respectively. 
Now 

hah? = hja] 
so that 


(128) V,0V,7? = U ntal 


for all (haje Hx A. Let P be the projection valued measure on A 
corresponding to U. Now, a routine calculation based on transport of 
structure shows that the projection valued measure of the representation 
a —> V UaV} of A is H-+>V,P,;V,~1, and that corresponding to the 
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representation a —> U ntaj; is E —> Pr In view of the uniqueness of the 
projection valued measures which correspond to representations of A we 
infer from (128) that 


(129) ViPeVi* = Pre 


(V,P) is thus a system of imprimitivity for H based on A. 
Conversely, let us start with U, V, and P such that P and V satisfy 
(129). Then U and V satisfy (128). Define W on G by 


(130) W an = UVa- 


The relations (128) are now enough to secure the fact that W is a homo- 
morphism. Since a —> U, and h —> V „ are Borel, x > W, is Borel. W is 
thus a representation of G and its restrictions to H and A are V and U, 
respectively. Since G= AH, W is unique. The lemma is proved. 


The lemma just proved enables us to establish a one-one correspondence 
between representations of G and systems of imprimitivity of H based 
on A. 


Lemma 6.23. A representation W of G is irreducible if and only if the 
corresponding system of imprimitivity for H based on A is irreducible. 
Two representations of G are equivalent if and only if the corresponding 
systems of imprimitivity are equivalent. 


Proof. Let W be a representation of G in # and let (V,P) be the 
corresponding system of imprimitivity, also acting in # We use the 
notation of lemma 6.22. If T is an operator in #, then T' commutes with 
all the U, (a €e A) if and only if T commutes with all Pp. From this it 
follows that T lies in the commuting ring of W if and only if T lies in the 
commuting ring of (V,P). This proves the first assertion. For the second, 
let W‘ be a representation of G in #', and (V',P') the corresponding system 
of imprimitivity ({=1, 2). If T is a unitary isomorphism of #' onto #72, 
it follows from the uniqueness of the correspondence U'= P! that 
U,2=TU,'T~} for all ae A if and only if P,?=7P,'7T~1 for all Borel 
sets Ho A. From this the second assertion of the lemma follows quickly. 

The preceding lemmas and the theory of Sections 5 and 6 reduce the 
problem of describing the irreducible representations of G to the problem 
of describing the irreducible representations of the various stability sub- 
groups of H with respect to its action on A. In view of the applications to 
the study of relativistic equations we proceed to spell out the details of 
the relevant constructions. 

We choose a point z€ A and a o-finite measure « which is quasi 
invariant on the H-space A and which lives on the orbit H[z,)]=X. Let 
H,,, =H, be the stability group at xo, i.e., 


(131) Hy = {h:heH, h{xo] = xo}, 
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and let m be an irreducible representation of H, in a separable Hilbert 
space X. Let us define #=$%(A,%a) and let (V,P) be a system of 
imprimitivity for H based on A, living on the orbit X, and induced by the 
representation m. We put, for ahe G (ae A, he H) and fe Z, 


(132) (Warf (£) = x(a)(Vaf)(x) (we A). 


Since P,f=yzf for all Borel sets Ec A, it follows easily that, for the 
representation a —> U, of A corresponding to P, 


(133) (U.f)(z) = x(a) f(x) (xe A), 


so that W,,=U,V,. Thus W is a representation of G. From our work in 
Sections 5 and 6 we know that the equivalence class of the system (V,P) 
depends only on the measure class of a and the equivalence class of m. 
From lemmas 6.22 and 6.23 it follows that W is irreducible. We shall say 
that W is associated with x, and m. The following theorem is now an 
immediate consequence of our work in Sections 5 and 6 of the present 
chapter. 


Theorem 6.24 (Mackey [2] [3]). Let us choose, for each H-orbit w in A, 
one point x, on w, and an irreducible representation m, of the stability sub- 
group of H „at the point x,,. Then the representation W™:” of G associated with 
Xa and m is irreducible. W™° and W™'% are equivalent if and only if the 
orbits w and w' coincide, and the representations m and m' are equivalent. 
If the H-orbit structure of A is smooth, then each irreducible representation 
of G is equivalent to some W™, 


NOTES ON CHAPTER VI 


The theory of induced representations is originally due to Frobenius who 
studied it for finite groups. Its generalization to the category of locally 
compact second countable groups is due to Mackey. Nowadays Mackey’s 
theory is referred to as the Mackey Machine; it is also known as the little 
group method among physicists. Significant contributions were also made by 
I.M.Gel’fand and M.A.Naimark in their work on representations of 
complex classical groups (cf. Trudy Mat. Inst. Steklova, 36 (1950), 
pp. 1-288), and by F. Bruhat (Bull. Soc. Math. France, 84 (1956), pp. 97- 
205) who developed the theory of induced representations for Lie groups 
with the techniques of the theory of Schwartz distributions. Especially 
noteworthy are the various expositions of Mackey in: The Theory of unitary 
group representations in Physics, Probability, and Number Theory, Benjamin / 
Cummings, Reading, Mass., 1978; Harmonic analysis as the exploitation of 
symmetry—a historical survey, Bull. Amer. Math. Soc., 3 (1980), pp. 543-698. 

Unitary representation theory is so vast and currently so active that it is 
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not possible to begin to sketch its outlines. For some notable surveys see: 
I. M. Gel’fand, Proceedings of the International Congress of Mathematicians 
Stockholm, 1962, pp. 74-85; the articles in Harmonic Analysis on homo- 
geneous spaces, Proceedings of Symposia in Pure Mathematics, Vol. XX VI, 
Amer. Math. Soc. Providence, R.I., (ed.) C.C.Moore, 1973; and V.S. 
Varadarajan, Harmonic analysts on real semisimple Lie groups, Proceedings 
of the International Congress of Mathematicians, Vancouver, 1974, 
pp. 121-127. The theory for semisimple groups is mainly the creation of 
Harish-Chandra; for this, see Harish-Chandra, Collected Papers, 4 Vols., 
Springer-Verlag, New York, 1984; see also Harmonic analysis and repre- 
sentations of semisimple Lie groups, Reidel, Dordrecht, 1980, edited by 
J. A. Wolf, M. Cahen, and M. de Wilde. Recently, interest has intensified in 
representation theory of infinite dimensional groups; for a view through the 
theory of infinite dimensional Lie algebras see the book of Kac and the 
references therein: V. G. Kac, Infinite dimensional Lie algebras, Birkhauser, 
Boston, 1983. 


CHAPTER VII 
MULTIPLIERS 


1, THE PROJECTIVE GROUP 


Earlier in Chapter III we showed that in any mathematical description 
of a quantum mechanical system the requirement of covariance introduces 
a representation of the symmetry group G of the system into the group 
Aut(S), where S is the set of all states of the system in question. If we 
assume that the logic of the system is standard, this representation can be 
replaced by a representation of G into the group of all symmetries of the 
Hilbert space underlying the logic. If G° is the subgroup of G consisting of 
all elements in the same connected component as the identity of G, then it 
can be shown that when G is a Lie group the symmetry corresponding to 
each element of G° is a unitary rather than an antiunitary operatort. 
However, these unitary operators are not uniquely determined. Each of 
them can obviously be multiplied by a complex number of modulus 1 
(called a phase factor) without changing the induced automorphism of the 
logic. It is therefore not immediately obvious that we have a unitary 
representation of G®, i.e., that the phase factors involved are all removable. 

In this chapter we shall examine this question. We remark that it was 
first studied by Wigner [1]. He proved, among other things that, if G is the 
inhomogeneous Lorentz group, any representation of G in Aut( S) can be 
induced by a unitary representation, at least of the universal covering 
group of G. This result was the starting point for his classification of 
relativistic wave equations. Wigner’s discussion of this problem, however, 
was restricted to the Lorentz and rotation groups. It was Bargmann [1] 
who first examined these problems systematically. He obtained many 
general theorems which included Wigner’s results as special cases. 

In his work on group representations, Mackey was led to some of these 


+ If xeG is of the form y? for ye G, then the symmetry associated with x is 
necessarily unitary. Now, if G is a Lie group and Q? the connected component of the 
identity, then for any x of the form exp X (X in the Lie algebra of G), x= y?, where 
y=exp 4X. Since the range of the map X — exp X contains a neighborhood of the 
identity, it follows that every element in a certain neighborhood N of the identity 
of G is mapped into a unitary operator in a given representation. Thus the elements 
of the group generated by the elements of N are also mapped into unitary operators. 
But G° coincides with this group. 
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problems from a purely mathematical point of view. In his paper [6] he 
formulated the basic concepts of a theory of such phase factors for 
arbitrary lcsc groups and showed how the subject can be fitted into the 
general framework of the theory of group extensions. This is the point 
of view taken in this chapter. With every phase factor, or multiplier as 
we call them, we shall, following Mackey, associate a group extension of 
the unit circle by the given group and reduce the solution of problems 
involving that multiplier to problems involving this group extension. 
The subject then becomes essentially a part of a cohomology theory of 
certain types of extensions. 

Our purpose is not to develop a general theory of group extensions. 
Such a general discussion may be found, in various contexts and at widely 
differing levels, in the works of Hochschild [1], [2], Calabi [1], Moore [1], 
Mackey [7], [8], Kleppner [1], and a number of other authors. Our aim is 
the much more modest one of developing in some detail the basic concepts 
of group extensions, so as to be able to obtain the solutions to the problems 
arising in connection with the physical space-time symmetry groups. 

We might mention that Hermann Wey] (cf. [1]), has also emphasized 
the fact that it is the automorphism of the logic and not the underlying 
symmetry that is of significance in physics. Some of the facts about 
projective representations may be found in his book. 

Let # be a complex separable Hilbert space. We denote by the 
unitary group of # If we equip Y with the strong topology, we have seen 
in Chapter V that Y becomes a metrizable second countable topological 
group and that its Borel structure is standard. We denote by Z the set of 
all elements of Y which are multiples of the identity operator. Z is a 
closed, normal, subgroup of Y and is central, i.e., each element of Z 
commutes with all the elements of Y. The group 


(1) P = UZ 


is called the projective group of A. We write m for the canonical homo- 
morphism 


(2) m: U—>n(U) (UE) 


of Y onto F. Theorem 4.27 now tells us that the subgroup of the group of 
automorphisms of the logic of #, consisting of automorphisms which can 
be induced by unitary operators, is isomorphic to 2. 

We shall equip Z with the quotient topology. Thus, a set ACF is 
defined to be open if and only if 7~1(A) is an open subset of X. It is easy 
to show that Z becomes a metrizable topological group satisfying the 
second axiom of countability and that 7 is an open continuous map of WY 
onto Z If we write 


(3) T = {z:zeE0, |z| = 1}, 
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then the kernel of z is precisely the set of operators of the form zl (z e T), 
and is hence a compact group. In all statements involving the topology 
and Borel structure on 2 we shall refer only to the above-described 
topology of F and its natural Borel structure. 


Lemma 7.1. The following statements concerning a sequence {U 4}n-01.3,- 
of elements in Y are equivalent: 


(i) [CUS >| > KUSS forall ff EX 
(il) there exists a sequence {z,} of complex numbers of modulus 1 such 
that z,U, > Uo in Y, 
(iii) m(U,)—>a7(U,) nF 


Proof. (ii) = (iii) obviously. On the other hand, for fixed f, f EX, 
U —> |<Uf,f'>| is a continuous function on Y which is constant on each 
Z -coset. Hence, by virtue of the fact that Z has the quotient topology, 
it defines a continuous function on Z The implication (iii) > (i) follows 
from the assertion that this function is continuous on Z We shall complete 
the proof of the lemma by showing that (i) = (ii). 

We begin the proof with a simple remark. Suppose U is an operator of 
H such that (CUS f> =|<ff| for all f, f'e Æ. Then U=t1 for some 
te T. In fact, for any f, Uf e {f}++, so that Uf is a multiple of f. Since this 
is true for each f, U must be a multiple of the identity, U =t1. It is 
obvious that |t|=1, and thus te T. 

This said, we proceed to show that (i) = (ii). Replacing U, by U,U)7~+* 
we may assume that U,=1. Since ||U,|| <1, there exists an operator U 
with ||U|| <1 and a subsequence {U,,} such that ¢(U,,f,f’> > (Uf,f> for 
all f, f € # (Dunford-Schwartz [1]). From our remark in the previous 
paragraph, we conclude that U=#1] for some te T. Since each U, is 
unitary, U,,f— tf for each f so that U„, —> tl. Since we can apply this 
argument to any subsequence of {U}, it follows that {U,, Uo,---} is a set 
with compact closure and that its only limit points are of the form 
tl (te T). Choose a unit vector f and write (for sufficiently large n) 
z= CU, f, f>—"|(Onff>|. We claim that 1 is the only limit point of the 
sequence {z,U,}. If Zn Un, > V, then V=tl for some ¢e 7’, and as 
<2n,Un, Jo J> is real and >0 for all sufficiently large k, t=1. Since 
{21U 1, 2.U2,---} has compact closure, we see that z,U, — 1. 

For f, f'E # and Ve Y, we write d; ,(7(V))=|<Vf,f’)|. 


Corollary 7.2. The topology for P is the smallest one with respect to which 
the maps d, , are all continuous. If Z is a Borel space and z —> U(z) is a 
map of Z into U, then z —> n(U(z)) is a Borel map of Z into P if and only if 
z —> |(U(z)f,f'>| is Borel for all f, f) E Z. 


Proof. Let 7 be the smallest topology in question. We have already 
seen that |(Vf,f>|=|(Wff’>| for all f, f'E # if and only if n(V)=7(W). 
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JZ is hence Hausdorff. Let D be a countable dense subset of # Then an 
easy argument shows that 7 is also the smallest topology with respect to 
which the maps d; p (f, f €e D) are continuous. Hence J is metrizable 
and lemma 7.1 now implies the first statement. The second statement is 
now clear. 


Lemma 7.3. Let fe # be nonzero and let 
(4) Q, ={U:U EU, <Uf,f> is real and > 0}. 


Then Q, is a Borel set in Y containing 1 which meets each Z -coset at most 
once. Moreover, z[Q,] is open in F and v is a homeomorphism of Q, onto 


n[Q;]. 

Proof. That Q, is a Borel set containing 1 is obvious. If <Uf,f>>0 and 
te T, <tUf,fy>0 if and only if t=1. Thus Q, meets each Z -coset at most 
once. On the other hand, if (Uf,f)>=t40, then for V=|t|t-'U, one has 
<Vf,f>=|t| >0. This proves that 


(5) am *(7[Q,]) = {U:U EU, <Uf.f> + 0}. 


Equation (5) shows at once that 7[Q,] is open in Z It remains to prove that 
a is a homeomorphism on Q,. Let {Uj}, =0,1,... be a sequence in Q, such that 
a(U,,)>7(U,) in Z. By lemma 7.1 there exists a sequence {t,,} in T such 


that t,U, > Uo in Y. Since |CU,f,f>|=<(Unf, f> > |CU of. f>| =U off): 
it follows that {t,} itself converges to 1. This shows that U, — Uo. As v is 
continuous, this proves that r is a homeomorphism and completes the 
proof of the lemma. 


Theorem 7.4. F is a standard Borel group. There exists a Borel map c 
from P into «U, 


(6) c:x%—->c(x) 
such that 
(i) a(c(z)) =x forall xe? 
(7) (ü) e(w(1)) = 1, 
(iii) there is an open set containing n(1) on which c ts continuous. 


The range of c is a Borel subset of U which meets each Z -coset exactly once. 
Finally, if f is a map of P into some Borel space X, f is Borel if and only if 
the map U —> f(n(U)) of Y into X is Borel. In particular, ASF is Borel if 
and only if 7~1(A) is a Borel set in Y. 


Proof. We begin our proof with the construction of c. Let {f1, -s Jate 3} 
be a countable dense set of nonzero vectors and let (cf. (4)) 


(8) Qn = Q%,- 
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We now define sets E, E2,- SU as follows: 


(9) Ey = Qi, 
and for n>], 
(10) E, = Qa A 1) {0 : Uff = 0} 
Let 
(11) E = UE, 
n=1 


E is a Borel subset of %. We claim that E meets each Z -coset exactly 
once. In fact, let V € Y and let m be the smallest of integers 7 such that 
< Vf fo 49; it is then obvious that V2 A H=VZ A E,, consists of one 
element. £ is thus a Borel set meeting each Z -coset exactly once. 

The canonical map ~ is thus one-one on E and maps E onto F Since F 
is a separable Borel space and £ is standard, the image of any Borel 
subset Fc # under v is a Borel set in Z Hence ~v is a Borel isomorphism 
on E. In particular, Z is a standard Borel space and the map c, which 
inverts the restriction of 7 to E, is a Borel map of Z into Y such that 
a(c(x)) =2 for all x e Z We now observe that E, =Q, and 1 e £. Hence we 
use lemma 7.3 to conclude that 7[£,] is open in F and that c is continuous 
on r[ E]. l 


2. MULTIPLIERS AND PROJECTIVE REPRESENTATIONS 


After these preliminaries involving the projective group, we are in a 
position to introduce the basic concepts centering around the so-called 
projective (or ray) representations of a group. We begin with the concept 
of a multiplier. 

Let G, K be lesc groups with K abelian. By a K-multiplier for G we mean 
a Borel map m 


(12) m : x, y —> m(xy) 
of Gx G into K such that 


(13) (i) m(z,yz)m(y,z) = m(xy,z)m(x,y) forall 2x, y,zEG, 

(ii) m(x,e) = m(e,x) = 1 forall xeG. 
Note that we are writing K multiplicatively and that we write 1 for the 
identity of K. If K=T, the multiplicative group of complex numbers of 
modulus 1, we omit the prefix K and speak simply of multipliers. If K is 
additively written, the equations (13) become 


(14) (i) m(xy,z)+m(x,y) = m(x,yz)+m(y,z) forall 2, y,zeG, 


(ii) m(x,e) = mez) =0 forall wveG. 
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It follows from (13), on putting y =x71, z=a, that 

(15) m(x,0-1) = m(x t,x) 

for all x e G. 


The set of all K-multipliers for G is obviously a commutative group 
under pointwise multiplication. We denote this group by M,;’(G). Two 
K-multipliers m,, Mmo for G are said to be similar, m, ~m, in symbols, if 
there exists a Borel function a on G with values in K, 

a : x —> alx) 
such that 
a(xy) 
16 m(x,y) = ——— m(x, 
( ) 2( y) a(x)a(y) i( y) 
for all x, y e G. Note that a(e) is necessarily equal to 1. If a K-multiplier m 
is similar to the multiplier 1 : x, y —> 1, we shall say that m is exact. Thus m 
is exact if and only if for some Borel map a of G into K, 


a(xy) 
17 MIX, = meee 
a7 Ca) = ajala) 
for all x, y e G. From (17) it is easily calculated that the exact K-multi- 
pliers for G form a subgroup Eg(G) of M,’(G). We form the quotient 
group 
(18) M,(G) = Mx" (G)[E'x(@) 
and call it the K-multiplier group of G. When K=T we write M(G) for 


M,(G) and call it the multiplier group of G. 
Suppose that m is a multiplier for G. A mapping 


U :g— U, 
of G into the unitary group X of a separable Hilbert space # is said to be 
an m-representation if 
(i) g—> U, is Borel, i.e., g—><U,f,f is Borel for all 
JL €, 
(ii) U= 1, 
(iii) U,, = m(z,y)U,U, forall 2, yeG. 


(19) 


A Borel mapping g —> U, of G into Y is said to be a projective representation 
if there exists a multiplier m for G such that U is an m-representation; 
then m is obviously uniquely determined by U. We shall say that m is the 
multiplier of U. If U is a projective representation of G in 4 the map 


(20) moU :g-—>7(U,), 
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of G into the projective group Ž is Borel and (iii) of (19) shows that 
rmo U is a homomorphism. Conversely, we shall prove that every Borel 
homomorphism of G into F arises from a projective representation in 
this fashion. 


Theorem 7.5. Let m be any multiplier for G and let U be any m-repre- 
sentation of G acting in H. Then n o U is a continuous homomorphism of G 
into P If V is any projective representation of G in # such that wo U = 
mo V, then the multiplier of V is similar to m; and conversely, if m is a 
multiplier similar to m, there is an m'-representation V with no V=ro U. 
Suppose 


U : J —> Ug 


is a Borel homomorphism of G into F. Then there exists a projective representa- 
tion U of G in # such that 


(21) u=ro U; 


moreover U can be chosen to be continuous in some open subset of G containing 
1. Finally, if m is any multiplier for G, there exist m-representations of G. 


Proof. Let U be a projective representation of G in # Then x > x(U,) 
is a Borel map of G into Z Since U,, is a constant multiple of U,U,, 
"(U zy) =7(U,)7(U,). Thus ro U is a homomorphism of G into Z By 
lemma 5.28, mo U is continuous. Suppose U’ is another projective repre- 
sentation of G in # and let m and m’ be the multipliers of U and U’, 
respectively. Then m o U =v o U’ if and only if for each x e G there is an 
alx)e T such that U,’=a(x)U,. The equation U,~1U,,'=a(x)1 shows 
that the function a(x — a(x)) is Borel. If such an a exists, then a direct 
calculation shows that 


(22) m' (x,y) = may) E2 


a(x)a(y) 
for all x, y e G. Thus m~m’. Conversely, suppose m~m’. Then (22) is 
satisfied for a suitable Borel function a on G with values in T and, by 
setting U,’=a(x)U,, we obtain an m’-representation U’ with wo U'= 
moU. 
Next, let 


uU : g —> Ug 


be a Borel homomorphism of @ into P; u is continuous by lemma 5.28. 
We select a Borel map c of into Y satisfying the conditions (i) to (iii) of 
(7). Let 


(23) U, = c(u,). 
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Since c is Borel and U =c o u, U(g —> U,) is a Borel map of G into X. Let 
Ac be an open set containing (1) on which c is continuous. Then U 
is continuous on the open subset u~1(A) of G containing 1. If x,y eG, 
then it is obvious that there exists an m(z,y) € T such that 


(24) Us, = m(xy)U,U,, 


Since 
UU, tU! = m(z,y)l, 


it follows that x, y—> m(z,y) is a Borel map from G& xG into T. The 
equation u,U,,=Uz,u,, expressed in terms of the U,’s, gives (i) of (13), 
while the relations Uye = uer =, lead to (ii) of (13). m is, in other words, a 
multiplier for G, and U is an m-representation, continuous around e. 
Finally, let m be an arbitrary multiplier for G. We write # = #7(G,n,), 
where u, is a right Haar measure on G. For any x e G and f € % we define 


(25) (U,f)(g) = m(g,x)~*f (gx). 
It is readily verified that U,=1, U,,=m(z,y)U,U,. The equation 
|| Uf ||? = || f |? shows that each U, is unitary. If f, f' e Æ, 


(Uff) = Í m(g,x)~4f(gz)f'(g)*du,(g), 


and as the integrand is a Borel function on Gx X, x > (U,f,f’> is a Borel 
function on G. U is thus an m-representation. The proof of the theorem is 
complete. 


Corollary 7.6. Let m be any multiplier for G. Then m is similar to a 
multiplier which is continuous on some open subset of GxG containing 
(e,e). 

Proof. Let U be an m-representation of G, acting in a Hilbert space 2. 
From theorem 7.5 we know that there exists a multiplier m’ similar to m 
and an m’-representation U’ acting in # such that (i) mo U'=70 U, 
(ii) for a suitable open subset N of G containing e, x —> U,’ is continuous 
from N into Y. Let N, be an open set containing e such that V,N,CcN. 
As 

U,, = m (z,y)U,'U,’, 


and as U’ is continuous on N, it is clear that m’ is continuous on N, x Nj. 


Corollary 7.7. Let m be a multiplier for G and U any m-representation 
of Gin H. Then m is exact if and only if there exists a representation U’ of 
G in # such that wo U'=70 U. If U” is another representation of G in 
KH such that ro U"=70 U, there exists a continuous homomorphism y 
of G into T such that 


(26) U," = XU 
for allgeG. 
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Proof. The first assertion is an immediate consequence of theorem 7.5. 
For the second, we note that Uj~*U,"= y,-1 for each geéG, where 
x, E T. Then g — x, is Borel and multiplicative. Therefore y is a continuous 
homomorphism of G into T. 


3. MULTIPLIERS AND GROUP EXTENSIONS 


The obvious problem now facing us is of course that of finding, for given 
lesc groups G and K (K abelian), the K-multiplier group M x(G). If, for 
example, K=T and M(G) is trivial (as it may well be), then we know 
that every multiplier for G is exact. In this case there is no practical 
difference between projective and ordinary representations. Our aim is to 
examine this problem at least for special classes of G and K. We introduce 
in this section a very useful technique to study the multipliers for a given 
group, namely the method of group extensions (cf. Mackey [6)). 

Suppose G and K are two lese groups. We shall, as always, assume that 
K is abelian. By an eztension of K by G we mean a triple (H,7,7), where H 
is a lesc group, 2 is an isomorphism of K onto a closed normal subgroup of 
H, and j is a homomorphism of H onto G with kernel i[K]. We require that 
t be a homeomorphism and that j be continuous. (H,i,ĵj) is said to be a 
central extension if i[K] is contained in the center of H. In the language of 
cohomology theory, (H,2,7) is an extension of K by G if and only if the 
sequence 


(27) 0— K = H — G — 0 


is exact. If (H,i,j) and (H’,i’,7') are two extensions, then we say that they 
are equivalent, if there is an isomorphism q of H onto H’, such that, the 
diagram 


AL, 
a N 
K q G 


(28) 

H' 
commutes, i.e., for all ke K, 
(29) q(v(k)) = i(k), 
and for allhe H, 
(30) j'a) = j (h). 


Once again we require that q be a homeomorphism, i.e., g is an isomorph- 
ism of the lesc group H onto the lese group H’. We recall that for two lese 
groups A and B, any continuous homomorphism of A into B, which is 
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one-one and maps A onto B, is necessarily a homeomorphism (cf. 
Pontrjagin [1}). 

We shall now associate with any central extension (H,i,7) of K by G, a 
K-multiplier for G. Since H is lcsc and i[K] is a closed subgroup, there is a 
Borel section c based on G, i.e., a Borel map c of G into H, 


(31) c: G —> H 

such that c(e)=1(1), the identity of H, and for all x e€ G, 
j(clx)) =x 

(cf. theorem 5.11). Therefore, the map 

(32) 2, k —> c(x)i(k) 


is a one-one Borel map of the standard Borel space G x K onto the standard 
Borel space H. Consequently, it is a Borel isomorphism. We denote by d, 
the inverse of this map, which is a Borel isomorphism of H onto Gx K: 


(33) d : c(xji(k) > x, k. 
If x, y € G, c(xy) and c(x)c(y) belong to the same i[K]-coset of H, so that 
(34) m(x,y) = t~*[e(xy)~*e(x)c(y)] 


is an element of K. Clearly m is a Borel map of G x G into K and m(z,e) = 
m(e,x)=1 for all x e G. If we now remember that 2[m(x,y)] commutes with 
all elements of H, we find, for x, y,z€G, 


i[m(x,yz)m(y,2)] = c(xyz)~ *e(x)c(y)e(z) 

c(xyz) ~ *e(xy){e[m(x,y) ]}c(z) 
c(xyz)~ *e(xy)c(z)t[m(x,y)] 
um(xy.z)m(x,y)]. 


Thus m is a K-multiplier for G. We note at the same time that 
(35) c(ax)i(k)c(y)i(k’) = c(xy)i[m(x,y) kk] 


for all x, ye Gand k, k’ e K. If we use the fact that d is a Borel isomorph- 
ism between the Borel spaces H and Gx K, then (35) tells us that the 
definition of a product operation in Gx K given by 


(36) (x,k)(y,k') = (xy,m(x,y)kk’) 


converts Gx K into a Borel group, and that d provides a Borel group 
isomorphism of this group with H. 

The converse to the result described just now is a much more difficult 
question. Is it possible to associate with each K-multiplier for G a central 
extension of K by G? The answer is affirmative as the following theorem, 
due to Mackey [6], shows. 
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Theorem 7.8. Let m be any K-multiplier. If we define for (x,k), (y,k')e 
Gx kK, 
(37) (x,k)(y,k') = (xy,m(xy)kk’), 
then Gx K becomes a standard Borel group under this product, and there 
exists a unique lesc topology for Gx K such that (a) Gx K becomes a lese 
group, denoted by Gx,,K, under this topology, and (b) the Borel structure 
of this topology coincides with the product Borel structure on Gx K. If 
to(k) = (e,k), 
j o(9,4) = 9; 
then (G XmK, io, Jo) îs a central extension of K by G. Every central extension 
of K by G is equivalent to one such. The extensions corresponding to the 
K-multiplers m, and m, are equivalent if and only if m, and m, are similar. 


(38) 


Proof. We begin with a central extension (H,1,7) of K by G and associate 
with it a K-multiplier m, using the development contained in the equations 
(32) through (36). Then 

d : c(x)i(k) —> (x,k) 
is an isomorphism of H onto G x,,K, where G x,,K is considered a group 
with (37) defining the product operation. G x,,K is obviously a standard 
Borel group and is also lesc if we equip it with the topology which makes d 
a homeomorphism. It is obvious that the extensions (H,i,j) and 
(G x5, tos jo) are equivalent. 

Conversely, let m be a K-multiplier for G. Then under (37), Gx K 
becomes a standard Borel group, as is readily verified with the help 
of the relations (13). We write Gx,,K for this group. The relations 
m(x,e)=m(e,z)=1 show that {(e,k):keK} is a central subgroup of 
G x,,K. Clearly i9(k — (e,k)) and jo((g,k) —> g) are Borel homomorphisms. 
We shall next prove that if À is a Haar measure on K and pis a left Haar 
measure on G, u x A is a left invariant measure on G x,,K. In fact, if f is a 
p x A-summable Borel function on G x K and (goko) E Gx K, we have, by 
Fubini’s theorem, 


Í f f((Gorko)(gsk))d(e x ANg, k) Í Í f (gog mlgo,g)kok)d(u x A)(g,k) 


GxK GxK 


= f ( f, f (aoa k)AX(H))dulg) 
=f (f f (gag RYdu(o)) aN 


[([ tomauc)anw 
f(g kdlu x ANg:k). 


K 


G 


x 
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p x À is thus left invariant. Theorem 5.41 now enables us to conclude that 
Gx,,K becomes a lcsc group under its Weil topology. The Borel structure 
of this topology is by choice the product Borel structure on Gx K, and it 
is the only lesc topology with this property, as proved in that theorem. 
The map 
j 0°9, k —> g 

is a Borel homomorphism of the lesc group G Xm K onto the lese group G 
and is thus continuous. This proves that i.[K]=kernel (jo) is closed in 
G x,,K. A similar reasoning shows that i) is continuous also. Therefore 
(G x,, A; to, Jo) is a central extension of K by G. 

The only thing that remains to be proved is that the extensions 
Gx,,K and @Gx,,K are equivalent if and only if m, œm. For any 
K-multiplier m, let us write 


(39) Hn = OX,_)K. 
Suppose m,~m,. Then there exists a Borel map a, 
a : g —> alg) 
of G into K such that 
_alzy) 


Molz, y) = m (£Y) ———— alejalg) 


for all x, y e Gx G. If we write 
(40) q(g,k) = (g,a(g)k), 


then it is trivial to check that q is a Borel group isomorphism of Hm, onto 
H m, Since H,,, and H,,, are lesc groups, q is necessarily a homeomorphism. 
It is obvious that q sets up an equivalence between (Hm,t0o:Jo) and 
(H mai0o:Jo). Conversely, let us suppose that Hm, and H„, are equivalent 
extensions. Then there exists a Borel isomorphism ¢ of H,,, onto Hm, such 
that 

q(e,k) = (e,k) 
for all ke K, and 

jo(a(g,*)) = 9 
for all g e G. Let a(g) € K be defined by 


(41) q(9,1) = (g,a(g)). 
a is clearly a Borel map of G into K. Then, as (g,1)(e,k) =(g,k) we find 
q(9,k) = (g,a(g))(e,k) (in H ma) 
= (g,a(9)k). 
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If we now use the fact that q preserves the group structures, we can 
conclude, after a brief computation, that 


M(x,y) = Mm, (X,Y) sae 


for all x, y e Gx G. In other words, m, ~m,. The theorem is completely 
proved. 


Remark. The essential difficulty of the theorem proved above resides 
in the fact that when the K-multiplier m is not continuous on G x G, the 
product topology would no longer serve to make G x,, K lesc. In fact, if m 
is continuous on Gx G, it is obvious that under the product topology, H m 
is a topological group and is even a lesc group. Since the product topology 
generates the product Borel structure, it follows from the uniqueness of 
the Weil topology that the two topologies coincide. Conversely, if the Weil 
topology for H,, coincides with the product topology, the map x — (7,1) 
of G into H,, is continuous and hence x, y — (2,1)(y,1) = (xy,m(x,y)) is also 
continuous, showing that m is continuous. Note that, in this case, 
Co : x — (x,1) gives a global continuous section for H,,/to[K], i.e., Co is 
continuous and 4,(co(x))=x for all xeG. This shows that, by using 
continuous K-multipliers, we can construct only those central extensions 
(H,i,j) of K by G for which there exists a continuous section c from G into 
H. There are many examples where such global sections do not exist. 
For instance, if K is finite but both H and G are compact and connected, 
no continuous section for Hfi[K] can be defined on all of G. An example of 
this is obtained when we take H to be SU(n,C€), the special unitary group 
in n-dimensions, and K as the center of H. 

On the other hand, one can impose certain conditions on A under which 
any K-multiplier for G will be similar to one which is continuous in some 
open neighborhood of (e,e) in Gx G. Corollary 7.6 tells us that this is so 
when K = T. More generally, it can be shown that this is the case when K 
is a compact Lie group, or when K and G are both Lie groups. We shall 
call a K-multiplier locally continuous if it continues on some open set 
containing (e,e) in Gx G. The assumption of local continuity for m is of a 
much less serious nature than the assumption of global continuity. 

When m is locally continuous, it is almost obvious that one can con- 
struct, using local mappings, a fiber space topology for H,, over G which is 
locally trivial and which coincides with the Weil topology. We shall derive 
this as a corollary of theorem 7.8. The result is stated in corollary 7.10. 
We need an auxiliary result. 


Corollary 7.9. Suppose m is locally continuous. Then there exists an 
open set N around e such that x —> (x,1) is a homeomorphism of N into Hy. 
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Proof. We select a symmetric open set A, containing e with compact 
closure such that m is continuous on A, x A, and select A,, A3,--+ such 
that each A, is open, symmetric, contains e, and 4,,,4,4,:¢A4, for all n. 
Let {x,} be a sequence in Ag, let x, € Ag, and let x, —> a>. We shall show 
that (x,,1) —> (%9,1) in the Weil topology of Hm. Write y,=2,)~1z,. Then 
Yn E Ao, Yn —> e, and 


(42) (£o1)(Yn:L)(em(£0 Yn) *) = (x,,1). 


Since the map k — (e,k) is continuous from K into H,, and since m is 
continuous on A, x A,, (42) shows that (z,,1) will converge to (%p,1) 
provided (¥,,1) — (e,1). 

Consider now A, x K =j)~1(Aq). This is an open subset of H,, containing 
(e,1) and (y,,1) belongs to it for all n. In order to prove that (y,,1) — (e,1) 
it is enough to prove that ||L,,.f—f||—>0 for all f in ¥?(H,,) which 
vanish outside 4, x K. Let D be the set of all functions on Gx K, con- 
tinuous with respect to the product topology and vanishing outside 
subsets of A, x K which are compact in the product topology; it is enough 
to prove that Ly, +f — f for all fe D. 

Fix f e D and let C S A, and K, cK be compact sets such that f vanishes 
outside C x K,. Now, 

bt"), 


(Lid (,k) = (unt MY, Yn) 


If x éC, y,~ tx ¢C for sufficiently large n and (Ly, 1f )(x,k)=f(x,k)=0 for 
sufficiently large n. If x € Ay, the continuity of m on A, x A, implies that 
(Liy,.wd)(2,4) —> f(x,k) for all k e K. Now, |f(x,k)| < B for all (x,k) eG x K 
and so, |( Ly, »f)(2,4)| < B for all n and all (x,k). Further, if L,=m(A,xA;) 
then L, is compact in K and, for all n, Ly, 1f vanishes outside A, x 
KL, L, ~+ which is compact in the product topology. Hence, by Lebesgue’s 
convergence theorem, 


(43) [font Saux.) > 0, 
GxK 
We have thus proved that x —> (x,1) is continuous from A, into Hp. 
As Ag is compact, it is a homeomorphism. This finishes the proof of the 
corollary. 


Corollary 7.10. If m is continuous around (e,e), there exists an open set 
N containing e such that the Weil topology on the open set N x K coincides 
with the product topology. 


Proof. Let N be an open set around e such that 2 — (z,1) is a homeo- 
morphism of N into Hm. Since (x,k)=(z,l)(e,k) and since x — (z,1) and 
k —> (e,k) are both homeomorphisms, the corollary follows at once. 
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Remark. Corollary 7.10 shows that when m is locally continuous, H n, 
as a fiber space over G, has a locally trivial fibration. For a more intensive 
study of the topology of H,, we refer the reader to the work of Calabi [1]. 


The main source of technical complications in our theory is the fact that 
the topology on H,, in general bears no visible relation to that of Gand K. 
We now describe another corollary to theorem 7.8 which gives an example 
of the nature of the indirect arguments needed in analyzing Hm. 


Corollary 7.11. If K and G are both connected (resp. simply connected), so is 
Hm If K and G are compact, so ts Hm- 


Both the assertions follow from the fact that i [K] is a closed subgroup 
of H,, and G is isomorphic to H n/iol K]. 

Our first use of these group extensions is a description of the similarity 
classes of K-multipliers. Let m be a K-multiplier for G, and H,, the corre- 
sponding group extension. For any Borel map a of G into K witha(e) = 1, 
let 


Cq(x) = (x,a(x)), 


a(x)a(y) 
a(xy) 


It is easily seen that m,, =m,, if and only if there is a Borel homomorphism 
k of G into K such that a,(x)=k(x)a,(x) for all xv e G. The map ca > ma 
establishes a correspondence between multipliers similar to m and Borel 
sections, and reduces the analysis of the similarity class of m to the 
analysis of the geometry of H,, as a fiber space over G. 


(44) 
Maly) = m(x,y) 


Theorem 7.12. c, is continuous (respectively continuous around e) if 
and only if m, is continuous (respectively continuous around (e,e)). m is 
exact if and only if there exists a Borel section c which is a homomorphism of 
G into Hm 


Proof. Regarding continuity we shall prove only local assertions; the 
global ones follow from similar reasoning. Suppose now c, is continuous 
on an open set N around e. Choose an open set N, around e such that 


N N EN. Then the map 

(45) q © £, Y —> Cal£Y)Ca( Y) Cale)? 

is continuous from N, x N, into Hm. A simple calculation yields 
(46) q(z,y) = (e,m,(£,y)~?). 


Since k — (e,k) is a homeomorphism, (46) shows that m, is continuous 
on N, x N. Conversely, let m, be continuous on N x N for a suitable N. 
If we form the group Hm, we know from corollary 7.9 that d:x—>(x,1) 
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is a homeomorphism of an open set N S&N containing e into Hm, Now, 
from the relation between m and m,, it follows quickly that 


p : x, k —> x, ka(x)-? 


is a Borel group isomorphism of Hm on Hm, Hence p is a homeomorphism. 
Therefore, the map p~!od is a homeomorphism of N, into H,,. But 
(p~? o d)(x)=(x,a(x))=ca(x). Therefore c, is a homeomorphism of N, 
into Hn. 

Now we come to the question of exactness. By very definition, m is 
exact if and only if for some a, m,(x,y)=1 for all x, y e G, i.e., 


__a(ay) 
a(x)a(y) 

for all x, y e G. A trivial computation shows that (47) is true if and only if 

x ->(x,a(x)) is a homomorphism of G into Hm. The proof of theorem 7.12 

is complete. 


(47) m(x,y) 


A trivial reformulation of the second half of this theorem gives the 
following corollary: 


Corollary 7.13. m is exact if and only if there exists a subgroup G’ of 
H,, such that G’ is a Borel set meeting each 1,[K]-coset exactly once, t.e., 
GQ’ A 1[K]={(e,1)} and G’-1.[K]=H,,. 


We shall now make a brief digression by obtaining some more corollaries 
which may serve as illustrations of the use and applicability of the 
preceding theory. 


Corollary 7.14. Let G be connected and abelian. Then, for a K-multiplier 
m to be exact, it is necessary and sufficient that m(x,y)=m/(y,x) in some 
neighborhood of (e,e). 


Proof. That an exact multiplier is symmetric (when G is abelian) 
follows at once from (17). Conversely, let N be an open set containing e 
such that m(x,y)=m(y,x) for all «,yeNxWN. Then, the formula for 
multiplication in H,, shows that (x,k) and (y,k’) commute whenever x 
and y belong to N. Since G and K are connected, so is H,, and hence the 
elements of the form (z,k) (x e N) generate Hm. This shows that H,, is 
itself abelian. Since K is isomorphic to 7” x Vs, it follows from the struc- 
ture of locally compact abelian groups that {e} x K is a direct summand of 
H,,. Corollary 7.13 now implies that m is exact. 


Corollary 7.15. There exists a multiplier m’ similar to m such that m' is 
locally bounded as a map of Gx G into K (cf. Section 5 of Chapter V). 


Proof. Let a be chosen so that c, is a regular Borel section, i.e., x > 
(x,a(x)) is a map of G into Hm which sends compact sets of G into sets with 
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compact closure (in Hm). Defining q as in (45), it is obvious that q maps any 
compact subset of G x G into a subset of Hm which has compact closure. 
The formula (46) tells us now that m, is locally bounded. 


We shall end this section with an application to the theory of projective 
representations. 


Theorem 7.16. Let m be any multiplier for G and let U(g—>U,) be an 
m-representation of G in a (separable) Hilbert space Æ. Then (x,k) > kU, 
is a representation of H,, in H. Conversely, if (x,k) > Viz, 18 a representa- 
tion of Hm such that Vie =k 11 for all ke T, and if we write U z= Viz,» 
then U(x —> U) is an m-representation of G and V(x =k tU n 


Proof. The proof needs only minor calculations and is omitted. 


Corollary 7.17. Let m be any multiplier for G. Then there exists an irre- 
ducible m-representation of G. 


Proof. Let V be a representation of H,, such that Vi. ,.=k7~11 for all 
keT. Such V exist, by theorem 7.5, because G admits m-representations. 
Let us decompose V into a direct integral of irreducible representations V, 


(48) V x | Vede(e) 


(cf. Mautner [1], Mackey [5]). Then, from the theory of the direct integrals 
of representations, it is easy to see that for some £o, Vi9,,=k711 for all 
ke T (in fact x-almost all € will have this property). Write V° = Vo, and 
let U? be the corresponding m-representation of @. Since VO, .=k7'U,°, 
the irreducibility of V° implies that of U°. 


Corollary 7.18. If G is compact and m is a multiplier for G, then there 
exist finite dimensional m-representations of G. The irreducible m-representa- 
tions of G are finite dimensional and every m-representation of G is a direct 
sum of irreducible m-representations. 


Proof. We observe that H,, is compact (corollary 7.11). The present 
corollary now follows in exactly the same fashion as the previous one. 
The proof is even more elementary, since only direct sums, rather than 
direct integrals, of representations, are involved. 


4. MULTIPLIERS FOR LIE GROUPS 


We shall now make a deeper study of the group extensions H,, under the 
assumption that K and G are both connected Lie groups. Since K is 
abelian, it would be a direct product of a real vector group and a torus. 
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The first basic result of this section tells us that H,, becomes a Lie group 
in a natural manner. The way is then opened for the application of 
results of Lie group theory. We shall use these to show that any K- 
multiplier for G is similar to one which is globally analytic on Gx G (at 
least when G is simply connected), and that the elements of the multiplier 
group are in natural one-one correspondence with equivalence classes of 
central extensions of the Lie algebra of K by the Lie algebra of G. For 
general information and terminology involving Lie groups and Lie algebras 
we refer the reader to the books of Chevalley [1], [2], [3] and Jacobson [2]. 

Throughout this chapter K will denote a connected abelian Lie group 
and G a connected Lie group. The symbol T” will denote a torus of dimen- 
sion r and the symbol V° will denote a real vector space of dimension s. 
Both are regarded as Lie groups in the natural sense. 

Our first aim is to introduce an analytic structure on H,,. The essential 
step in this is accomplished by the following lemma. 


Lemma 7.19. Let K be an additive vector group and m a K-multiplier 
for G. Then there exists a K multiplier m’ similar to m such that m’ is C” 
on GxG. 


Proof. Note that m satisfies the identities (14). It is enough to prove 
this when K is the additive group of real numbers, since the general 
case will follow by applying the special case to each component. By virtue 
of corollary 7.15 we may assume that m is bounded on each compact 
subset of Gx G. Let u be a left Haar measure and p, a right Haar measure 
for G. 

Let then K = R+. Choose a C” function with compact support, say f, 
which is such that 


(49) f foda) = 1. 
Let a be the function x — a(x), where 
(50) a(z) = | mlzflu)dyatu). 


Clearly, a is bounded on each compact set. Let 
(51) my(x,y) = m(x,y) + a(axy) — a(x) —ay). 


Then m, is a #}-multiplier similar to m and is bounded on compact 
subsets of Gx G. Moreover, 


m(x,y) = f im(z,y) +m(æy,u)—m(y,u) — mlew} (u)dplu) 


= f fam (2, yu) — ma(ceyu)},f (ujdu{u), 
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using the multiplier identities for m. Thus 


6) miey) =| manfiy udu- | mæn fudulu) 
The uniform convergence of the integrals on the right of (52) shows that 


for each x e G, y > m,(z,y) is a C” function. Define a, to be the Borel 
function y —> a,(y), where 


(53) a,(y) = f m, (1, 9) f (udp (u) 


f, being a C” function with compact support and satisfying 


(54) Í filu)dp,(u) = 
G 
Let 
(55) mig(2e,y) = m, (x,y) +44 (zy) — a(x) —a,(y). 


Equation (55) shows that m, is a R!-multiplier similar to m, and bounded 
on compact subsets of G x G. Moreover, a brief calculation shows that 


(56) = _m2(z,y) = f m,(u,y) fi (ux *)dy,(u) -Í m,(u,y)fi(u)dp,(u). 


From (56) it follows that m, is a C” function of each of its variables when 
the other is fixed. Finally, define a, by 


(57) alx) = f M(x,u)f(ujdu(u). 
Write 
(58) m'(xz,y) = Mma(x,y) +as(xy)—as(x)—as(y). 


Then m’ is a #&!-multiplier similar to m and 
69  m'(zy) = | mazufiy udu- | malyu)f udatu) 


Since x, y > ma(x,u)f (ytu) is C” on Gx G for each u, (59) shows that m’ 
is C” on Gx G. The lemma is proved. 

We shall now examine the case when K is not simply connected. We 
cannot hope for global results but the local version of lemma 7.19 can be 
proved by the same technique. 


Lemma 7.20. Let K be a connected abelian Lie group and m, any K- 
multiplier. Then there exists a K-multiplier mọ’ which is similar to mo and 
which is C° in an open neighborhood of (e,e). 
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Proof. Since lemma 7.19 takes care of the case K= R?, it is enough to 
prove the present result when K is a torus and thus when K =T. We may 
(and do) therefore assume that K =T. By corollary 7.6 we may assume 
that mo is locally continuous. It is obvious that we can choose a real 
bounded Borel function m on G xG such that 


(60) Mo(x,y) = exp{im(x,y)} 


for all x, y e G, and such that m satisfies the identities characteristic of an 
R}-multiplier (cf. (14)) locally, i.e., for some symmetric open set M with 
compact closure containing e, 


(61) m(x,e) = m(e,x) = 0 
for all x in M, and 
(62) m(xy,z)+m(x,y) = m(x,yz)+m(y,2) 


for all x, y, ze M. 

We now proceed exactly as we did in the previous lemma. We choose f 
to be C”, and satisfying (49), but with the extra requirement now that f 
be zero outside a compact subset of M. a is defined as in (50) and m, as 
in (51). Since exp(#m) =m, is a T-multiplier, (51) shows that exp(zm,) is a 
T-multiplier similar to mọ. m, is bounded on Gx G and satisfies (61) for 
x e M and (62) for x, y, z e M. Equation (52) is now valid for all x, y e M, 
from which we infer that for each xe M, y > m,(x,y) is O” in M. We 
define a, and m, as in (53) and (55), respectively, but after choosing fı 
such that f, also vanishes outside M. Once again, (56) is valid for all 
x,y e M. Proceeding as we did before, we infer finally that mg is C” on 
MxM. This shows that exp(tm3)=m’ is C” on MxM, and is a T- 
multiplier similar to mọ. The lemma is proved. 


We are now in a position to introduce a differentiable structure on H m. 
This will then enable us to treat H,, as a Lie group. From this, the intro- 
duction of analytic coordinates on H,, will follow in the usual fashion. 


Theorem 7.21. Let K be a connected abelian Lie group and G a connected 
Lie group. Let m be any K-multiplier for G. Then there exists a unique 
analytic structure for H,, (compatible with the Weil topology) which converts 
H,, into an analytic group. The maps k — (e,k), of K into Hm, and (g,k) —> g, 
of H,, onto G, are analytic. 


Proof. H,, is connected by corollary 7.11. We know that any topological 
group admits at most one compatible analytic structure which converts it 
into a Lie group (cf. Chevalley [1], pp. 127-129). Thus we are concerned 
only with existence. Since we may pass to A-multipliers similar to m, we 
may assume, after lemma 7.20, that m itself is C® around (e,e). From 
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corollary 7.10 we know that for some open set M containing e, the Weil 
topology on M x K coincides with the product topology. We choose M so 
small that this condition is satisfied and that m is C° on M x M. 

We give to M x K the product of the C® structures on M and K and 
choose a symmetric open set M, containing e such that M,M,cM. For 
(z,k) and (y,k’)eM,xK, 


y-t = (ayrip Med), 
(kyk) = (ay 4b? E 

which shows that the map (x,k), (y,k’) —> (z,k)(y,k’)~1 is C° on (M, x K) x 
(Mx K). By a standard result in Lie group theory (cf. Chevalley [1], 
pp. 134-135) there exists a C” structure on Hm converting H,, into a Lie 
group, and such that the induced C® structure on M x K coincides with 
the product C” structure around (e,1). In this manner one converts Hm 
into a connected Lie group. Note, under the assumption that m is C°” 
around (e,e), that there exist open sets M€ GQ and K S&K containing the 
respective identities, such that M, x K, has the product C® structure. 

Now, it is known (cf. Pontrjagin [1]) that every C” Lie group can 
be given a unique compatible analytic structure. H,, may be given this 
unique analytic structure. The final statement of the theorem follows 
from the known fact (cf. Chevalley [1], pp. 128-129) that a continuous 
homomorphism of one analytic group into another is necessarily analytic. 
The proof of the theorem is complete. 


We can deduce several corollaries from this theorem. 


Corollary 7.22. The map x->(x,1) of G into Hm is C® around e, m being 
locally C9. 

This follows from the fact that the C® structure of H,, is, around (e,1), 
the product structure. 


Corollary 7.23. Let m be an arbitrary K-multiplier for G. Then there 
exists a multiplier m’ which is similar to m and is C” (respectively analytic) 
on all of GxG if and only if there exists a C?” (respectively analytic) map 
x —> c(x) of G into Hm such that j9(c(x))=2 for all x, i.e., if and only if 
there exists a global C” (respectively analytic) section. 


Proof. The proof is the same as for theorem 7.12; we need only replace 
“continuous” by “C®” and “analytic” at the appropriate places. 


Corollary 7.24. Given any K-multiplier for G, there exists a multiplier 
similar to it which is analytic around (e,e). 


Proof. This requires the result that, for any analytic group, the coset 
space determined by a closed subgroup admits a local analytic section 
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(cf. Chevalley [1], pp. 109-111). Applying this result to the analytic group 
H m and the closed subgroup {e} x K =i [K], we find a Borel section c, 


c : x — (x,a(x)) 


such that for some open set N around e, ¢ is an analytic map on N - N. If we 
define 


q(x,y) = c(xy)c(y)~*e(x)~?, 


then q(x, y) = (e,m,(x,y)~*) (cf. (46)) and q is an analytic map of N x N into 
H m. Since k — (e,k) is an analytic isomorphism and since {e} x K is a closed 
analytic submanifold of Hm, we deduce that m, is analytic on NxN. 
Obviously m, œm. 

This is as far as we can go with elementary methods. To obtain further, 
and in particular, global results, greater use must be made of Lie group 
theory. Our aim is to use some of the ideas of Hochschild [1], which in turn 
are based on a very interesting lemma of Malčev [1], to prove that when G 
is connected and simply connected, there exists a canonical one-one 
correspondence between the Lie group extensions and the associated Lie 
algebra extensions. We shall moreover be able to show that analytic 
multipliers exist in every similarity class and that they are naturally 
induced by analytic K*-multipliers for G, K* being a covering group of K. 

We begin with some preliminary remarks. For any analytic homo- 
morphism a of a Lie group A into a Lie group B, we write a for the 
induced homomorphism of the Lie algebras; a(exp X)=exp a(X) for 
X Ea, where a is the Lie algebra of A. Let b be the Lie algebra of B and 
let us assume that A and B are both connected and simply connected. 
Consider now a homomorphism d(X —> d(X)) of the Lie algebra b into the 
Lie algebra of all endomorphisms of the vector space a such that d(X) is a 
derivation of a for all X, i.e., d(X)({ Y,Z]) =[(d(X)( VY), Z]+[Y,d(X)(Z)] for 
all Y,Z in a. Then there exists a unique analytic homomorphism 
b —> 8(b) of B into the Lie group of invertible linear transformations of the 
vector space a such that d(exp X)=exp d(X) for all X e b. Since d(X) is 
a derivation of a, d(exp X) is an automorphism of the Lie algebra a. 
Hence 6(6) is an automorphism of a for all b. The map b, Y — 6(b)Y is 
clearly analytic from Bx a to a (a being a finite dimensional real vector 
space, can be considered in an obvious fashion as a real analytic manifold). 
Since A is simply connected, each 6(b) gives rise to a unique automorphism 
D(b) of A such that D(b)(exp Y)=exp{d(d)(Y)} for all Yea. Since 
b —> 8(b) is a homomorphism, b —> D(b) is a homomorphism of B into the 
group of all analytic automorphisms of the Lie group A. We thus see that 
A becomes a B-space. Moreover, the equation D(b)(exp Y)=exp{6(b) Y} 
shows that the map b,a —> D(b)a is an analytic map of Bx A into A. 
Therefore if we form the semidirect product Bx) A (cf. Chapter VJ), 
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BxpA becomes a connected, simply connected Lie group. Its analytic 
structure is just the product structure of B x A. As usual, we shall identify 
A and B with subgroups of Bx,A; A will be a closed normal subgroup, 
and for be B, ae A, bab~} will coincide with D(b)a. Moreover, for 
Yea and X eb, exp X exp Y exp(— X) =exp{exp(d(X)) Y}. This shows 
that, if we identify the Lie algebra of B xpA with the direct sum b+a, 
then for Yeaand X e b, [X,YJ=d(X)Y. 


Lemma 7.25. Let g be a Lie algebra over a field of characteristic zero and 
gı an ideal in g. Suppose that either dim(g/g,) =1 or g/g, ts semisimple. Then 
gı ts a semidirect summand of g, i.e., there exists a Lie subalgebra p such 
that g=9,+ and g, O p=0. 


Proof. If dim(g/g,)=1, then we may take for p any one-dimensional 
subspace not in g,. In case g/g, is semisimple, the assertion is a refinement 
of the well known Levi-Maléev theorem and follows easily from it. Let 
r be the radical of g. Let m be the canonical map of g on g/gı. Since a[r] 
is a solvable ideal of g/g,, 7[v]=0, so that rcag,. Let p’ be a semisimple 
subalgebra of g such that p’ N r=0 and p’+1r=g. Such a p’ exists, by 
virtue of the Levi-Maléev theorem (cf. Chevalley [3], pp. 185-145). As 
p’ A g is an ideal in p’, there exists a subalgebra p (which is even an ideal 
in p’) such that p+(p’ N g,)=p’ and pn (p’ A g,)=0 (ef. Chevalley [3], 
pp. 64-65). p is semisimple and it is trivial to verify that p AO g,=0 and 
P+O1=6. 


Lemma 7.26 (Maléev [1]). Let H be a connected and simply connected 
Lie group and K a closed connected normal subgroup of H. Let y be the 
canonical homomorphism of H onto H|K. Then there exists an analytic map 
cof H/K into H such that 


(63) y(c(t)) = t 
for all te HJK. 


Proof. We follow Hochschild’s proof [1]. Let § be the Lie algebra of H 
and f the Lie subalgebra which corresponds to K. Since K is normal, f is 
an ideal in ). Let h=§,2>).2>---25,=£ be a composition series of ý 
over f, i.e., each ),,, is an ideal in h,, and is maximal in h; We prove 
lemma 7.26 by induction on 7, the length of the composition series. 

Since h, is maximal in ,, either dim(h,/,.)=1 or §,/, is semisimple. 
Therefore, by lemma 7.25, there exists a subalgebra b of h, such that 
H), =b + h, and b N h,=0. For X eb and Y e bp, write 


(64) d(X)Y = [X,Y]. 


Let H, and B be connected simply connected groups corresponding to hz 
and b, respectively. In view of our earlier remarks, the map X —> d(X) 
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gives rise to an analytic map b, a —> D(b)a of B x H, into H, such that H, 
becomes a B-space and for each b e B, D(b) is an analytic automorphism 
of H,. We may therefore form the group B xpH, and identify its Lie 
algebra with h, in such a way that B x {e,,,} and {ep} x H, correspond to b 
and hz, respectively (ex is the identity of B and ey, that of H). Now, 
B xpH, is simply connected and so is H. Hence we may identify H with 
B x,H,. In other words, we may assume that H, and B are the analytic 
subgroups of H corresponding to h, and b, respectively. Then H, O B= 
{ey} (the identity of H) and H = H,B. 

Let H=H/K and let y be the canonical homomorphism of H onto Ñ. 
Put H,=»)(H,),B=y(B). Now, for each b e B, D(b) leaves K and H, invari- 
ant and so induces an automorphism D(b) of H,/K. Clearly, b, a—> D(b)a is 
analytic from Bx (H,/K) into H/K and so one can form Bx,(H.,/K). It 
is obvious that our identification of H with BxpH,, gives rise to a 
natural identification of H with B xp(Ha/K). This proves that H, and B 
are closed in H, that H, N B={eg} and H=d.B. In particular, y is an 
isomorphism of B onto B. By the induction hypothesis, there is an analytic 
map c’ (8 —> c’(3)) of H, into H, such that y(c’(s)) =5 for all 5 e Hy. Define 
con H by 


(65) c(3y(b)) = c'(s)b = (8 Ha, be B). 

Then c satisfies (63). 
Corollary 7.27. The map 

(66) k, t — c(t)k 

is an analytic isomorphism of the manifold K x (H/K) onto the manifold H. 
Proof. Let ¢ denote this map. ¢ is analytic and one-one. Fora e H, 

t-1a) = (el(y(a))~ta,y(a)) 
which is clearly analytic. 


Corollary 7.28. Let h be a Lie algebra over the reals, H a connected and 
simply connected Lie group corresponding to h. Let t be an ideal in h. Then 
the analytic subgroup K of H which corresponds to t is closed. Moreover both 
K and HJK are simply connected. 


Proof. That K is closed is well known (cf. Chevalley [1], pp. 125-127). 
Corollary 7.27 implies that H is (topologically) homeomorphic to K x H/K, 
and so the simple connectivity of H implies that of K and H/K. 


Remark. It is known that if K and H/K are simply connected, so is H 
(cf. Chevalley [1], pp. 59-60). Thus corollary 7.28, together with this result, 
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completely solves the problem of simple connectivity for H, K, and 
H/K, when K is closed, connected, and normal. 


Lemma 7.29 (Hochschild [1]). Let G be a connected, simply connected Lie 
group and (f,1,7) an extension of the connected Lie group K by G. Then 
there exists an analytic map c of G into H such that 


(67) He(g)) = 9 
forallgeG. 


Proof. Let H* be a connected and simply connected covering group of 
H and let ô be the covering homomorphism of H* onto H. Then jo Sis a 
homomorphism of H* onto G. Let K, be the kernel of j o 8 and K,° the 
component of the identity of K,. If K,°#K,, H*/K,° would provide a 
nontrivial covering for G', which is not possible since G is simply connected. 
Hence K,=K,°,i.e., K, itself is connected. From Maléev’s lemma we know 
that there exists an analytic map c* of G into H* such that 


(je d)(c*(9)) = g 


for all g e G. If we write c=5 o c*, then c is analytic from G into H and 
satisfies (67) for all g e G. 


Corollary 7.30. Let G be a connected, simply connected Lie group and K 
a connected abelian Lie group. If m is a K-multiplier for G, then there exists 
a K-multiplier m’ similar to m which is analytic on Gx G. 


Proof. Let H,, be the extension associated with m. From lemma 10.29 
we know that there exists a global analytic section from G into H,,. Then, 
corollary 7.23 implies that there is a multiplier similar to m which is globally 
analytic over Gx G. 


Remark. That m’ can be chosen to be C” was proved by Bargmann [1]. 


We shall use the lemmas of Hochschild and Maléev to obtain a direct 
relation between central extensions of Lie groups and the corresponding 
central extensions of Lie algebras. Let f, g be Lie algebras over R with t 
abelian. A central extension of t by g is a triple (h,«,8), where is a Lie 
algebra over R, « is an isomorphism of t into the center of h, and £ a 
homomorphism of ġ onto g such that e[t] = kernel (8), i.e., 


(68) [o[t],5] = 0 
and 


(69) 0 —>t—> j= g +0 
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is an exact sequence. Central extensions (,a,8) and (’,«’,B’) of t by g are 
said to be equivalent if there exists a (Lie) isomorphism £ of onto h’ such 
that the diagram 


oo N | A 
H 
commutes. Suppose K is a connected abelian Lie group, G a connected 


Lie group, and (H,i,j) a central extension of K by G. Let f, ġ, and g 
be the Lie algebras of K, H, and G, respectively. Then it is obvious that 


(ġ,t,j) is a central extension of t by g. We shall say that (ġ,i,j) is the 
associated central extension of t by g. A routine transport of structure shows 
that corresponding to equivalent central extensions of K by G, are 
associated equivalent central extensions of t by g. 


Theorem 7.31. Let G, K be connected Lie groups and g, f the correspond- 
ing Lie algebras. Suppose that K is abelian and G is simply connected. For 
any equivalence class + of central extensions of K by G, let + denote the 
equivalence class of the associated central extensions of t by g. Then + > 7 
is a one-one correspondence and its range is the set of all equivalence classes 
of central extensions of t by g. 


Proof. We shall first give consideration to the case when K is also 
simply connected. Let (),a,8) be any central extension of t by g. Let H 
be a connected and simply connected Lie group which corresponds to }. 
Since K is simply connected, there exists an analytic homomorphism 7 
of K into H such that 1=a; similarly there exists an analytic homomorph- 
ism j of H onto G such that j=f. The range i[K] being an analytic sub- 
group of H having the ideal a[f] for its Lie algebra, corollary 7.28 implies 
that [K] is closed and simply connected. Hence 2 is also an isomorphism. 
On the other hand, 2[K] is the connected component containing the 
identity of the kernel K, of j. If o[A]4AK,, H/tLK] will be a nontrivial 
covering of G2H/K,, which is impossible as G is simply connected. 
Therefore [K] is precisely the kernel of j. In other words, (H,i,j) is a 
central extension of K by G and (b,«,8) is its associated extension. The 
uniqueness of 2 and 9, once H is specified, implies at once that two exten- 
sions of K by G, constructed in this manner, are equivalent if and only if 
their associated Lie algebra extensions are equivalent. Thus we have 
proved the lemma in the special case when K is simply connected. 

Suppose now that A is connected but not simply connected. Let K* 
be a connected, simply connected Lie group covering K. Let e be the 
covering homomorphism. Let Z be the kernel of e. In view of the result for 
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the simply connected case, it is enough to set up a natural correspondence 
between central extensions of K and K* (by G) which respects equivalence. 

Let (H*,1*,7*) be a central extension of K* by G. Then H* is simply 
connected. Let H = H*/i*Z and let ô be the canonical homomorphism of 
H* onto H. Then 6 has discrete kernel. Since kernel (5)<i*[K*], there is a 
unique homomorphism j of H onto G such that j*=j o ô. Since kernel 
(5) =2* [kernel (e)] there is a unique homomorphism 7 of K into H such 
that 6 o1*=10 e; ù is then injective. It is then obvious that kernel (j) = 
i[K]. Thus the diagram 


(71) 


commutes. Moreover, (/,1,7) is a central extension of K by G. 

Suppose conversely that (H,i,/) is a central extension of K by G. Let 
H* be a connected, simply connected Lie group which covers H. Let ô 
be the covering homomorphism. Then there exists a unique homomorph- 
ism 1* of K* into H* such that ô o i* =i o e. Write j* =j o 8. As Gis simply 
connected, we conclude that kernel (j*) is connected. From this it follows 
quickly that kernel (j*) =7*[K*]. Hence 7*[K*] is simply connected. Since 
K* is also simply connected, 7* is injective. Hence (H*,7*,j*) is a central 
extension of K* by G. Moreover, the diagram (71) commutes, and 


(72) kernel (6) = i*[Z]. 


We have thus set up a natural correspondence between central ex- 
tensions of K and K* by G. Routine diagram chasing now shows that 
this correspondence respects equivalence. The proof of theorem 7.31 is 
complete. 


Let (H*,1*,7*) be a central extension of K* by G and let (H,1,7) be a 
central extension of K by G. Suppose there is a homomorphism ô of H* 
onto H such that (71) is a commutative diagram. Then we say that 
(H*.1*,7*) covers (through e) (H,i,7). 


Corollary 7.32. Let G be connected and simply connected. Let K, K* 
and £ be as above. Then the map m* —> c o m* maps the set of all analytic 
(respectively C”) K*-multipliers onto the set of all analytic (respectively 0”) 
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K-multipliers. m,* ~ mo* if and only if e€ o m,* ~e o m,* so that m* —> c o m* 
induces an isomorphism of M,x.(G) onto M,(G). 


Proof. Let m be an analytic K-multiplier for G. Let H,, be the associated 
extension. Then H,, has the product analytic structure. Since we can cover 
the extension H,, by an extension of K* by G, we may assume that for 
some K*-multiplier m,*, the extension H m,. covers Hm. In view of corollary 
7.30, we may assume that m,* is analytic on GxG so that Hm,» has the 
product analytic structure. 

Let S={(g,1):geEG}CH,,. Let 5 be the covering homomorphism of 
Hm,» onto H,,. Write S*=6~*(S). A standard topological argument shows 
that ô is a covering map of each connected component of S* onto S. Let 
So be the component of S* containing the identity (e,1) of H,,,.. Then So is a 
closed regular analytic submanifold of H,,,. Since S is simply connected, 
ô is an analytic isomorphism of S, onto S,. Hence there is an analytic map 
a of G into K* such that for all g € G, 


(73) 6((9,a(9))) = (9,1). 
Since 6 is a homomorphism, (73) gives 
(74) 8(xy,m,*(x,y)a(x)a(y)) = 8((xy,a(xy)))(e,m(x,y)). 


Now, (xy,a(xy)) ~*(ay,a(x)a(y)m,*(x,y)) is computed to be 


(e a mre) 
so that (74) leads to the equation 
(e, ZAH m,*(e,y)) = (em(z,y)), 
and consequently, 
(75) (ee mi*(z,y)) = mizy) 


for all x, y e Œ. If we write m*(x,y) for the element in (75) to which e is 
applied, we have 


(76) m = eom* 


m* is clearly analytic. This proves the result in the analytic case. The C” 
case can be disposed off similarly. 

The equation eom*=m implies, on the other hand, that the map 
x, k* —> x, e(k*) is a homomorphism of Hm. on H,, which sets up a covering 
of the extension Hm by Hm. From theorem 7.31 we now conclude im- 
mediately that m,*~m,* if and only if e°m,*~eom,*. From this it 
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follows at once that m* —> e o m* induces an isomorphism of M,.(G) onto 
Mx(G). 


Corollary 7.33. Let G be a connected, simply connected Ine group and K 
a connected abelian Ine group. If m is a K-multuplier, then m is exact if 
and only if, in the extension of t by g which is associated with H n, the image of 
t is a semidirect summand. | 


Proof. Let (§,«,8) be the extension of f by g which is associated with 
(H nstorJo) (where 19(k)=(e,k) and jolg,k)=g). If m is exact, there exists a 
Borel homomorphism c(z — c(x)) of G into Hm such that j)(c(x)) =z for all 
xe G (theorem 7.12). Since G and H,, are Lie groups, c is analytic. Hence 
the range B of c is an analytic subgroup of H,,. If b is the subalgebra of $ 
corresponding to B, it is obvious that b A a[f]=0 and b +a[f]= 9. 

Conversely, suppose there exists a subalgebra b of such that 
b+alf]=h and b N o[f]=0. Then there exists obviously an isomorphism 
y of g into f such that y maps g onto b and £ o y is the identity. Since G is 
simply connected, there is a homomorphism c of G into H,, such that 
é=y. Since Boy is the identity, 7) °c is locally the identity. Since G is 
connected, jọ o c is the identity everywhere. This proves that m is exact. 

The only problem which is still to be examined is that of classifying 
the central extensions of f by g. This can be done very simply as the 
problem is linear. We now proceed to indicate the standard solution to 
this problem (cf. Chevalley-Eilenberg [1]). Let p be a bilinear map of 
g x g into f such that (i) p is skew symmetric, i.e., 


(77) P(X, Y)+p(V,X) = 0 
for all X, Y €e g, and (ii) p satisfies the identity 
(78) P(X, Y],Z)+p([Y,Z],X)+p([Z,X],Y) = 0 


for all X, Y, Z e g. We shall associate with p a central extension of t by 
g. Let 


hp = g xf, 
and let us define, for (X,S) and (X’,S’) € bp, 
(79) [(X,8),(X',8")] = ([X,X'],p(X,X’)). 


From the identities (77) and (78) it follows that ), becomes a Lie algebra 
under the bracket operation (79). Moreover, (79) shows trivially that 
(X,S) — X is a homomorphism of §, onto g, that (0,8) lies in the center 
of h, for all Sef, and that S — (0,8) is an isomorphism of f into b,. 
Write 
a(S) = (0,5), 
Bo(X,S8) = X. 


(f»,%,8o) is then a central extension of t by g. 


(80) 


272 GEOMETRY OF QUANTUM THEORY 


Lemma 7.34. Every central extension of { by g is equivalent to an extension 
bh, for a suitable bilinear map p of g x g into t satisfying (77) and (78). h,, and 
bp, are equivalent extensions if and only if there exists a linear map q of g 
into Í such that 


(81) PAX, ¥)—p,(X,¥) = g([X, Y]) 
for all X, Y Eg. 


Proof. Let (,a,8) be a central extension of t by g. By identifying t 
with a[f] and g with a linear manifold supplementary to aft], we may 
assume that h=gxf, «=«o, and B=ß,. We have now only to determine 
the form of [.,.] for pairs of elements of 4. Since (X,S8),(X’,S’) > 
[(X,S),(X’,S’)] is bilinear and skew symmetric and since (X,S)—> X is a 
homomorphism of §) onto g, it follows that there exists a skew symmetric 
bilinear map of § x into f, say r, such that 


(82) [(X,8),(X°,8")] = ([X,X"],7(X,8; X'S) 


for all X, X’ eg and S, S'et. The fact that (0,8) lies in the center of 
for all S in f now implies that 


(83) r(0,8; X'S) = 0 


for all X’ € g and S, S’ e t. The bilinear maps of h x into t which are skew 
symmetric and satisfy (83) depend only on X and X’. Thus there exists a 
skew symmetric bilinear map p of g x g into t such that r(X,S; X’,S’)= 
p(x,X’) for all X, X’ eg and S, S’ e t. We have then 


(84) [(X,8),(X",8")] = ([X,X"],p(X,X’)). 


If we now use the fact that [.,.] satisfies the Jacobi identities, we see that 
_ (78) is satisfied by p. In other words, the extension f is equivalent to },. 

Suppose now that },, and §,, are two extensions of t by g corresponding 
to the bilinear maps p, and pg. It is easily seen that for any linear map q 
of g into t, the map 


(85) Ly : (XS) — (X,q(X) +8) 


is a linear map of §,, into §,,, sends (0,S) to (0,8), and is such that the 
diagram 


Do, 1", Noa 


i \ 


is commutative; and that every linear map L of ,, into §,,, which sends 
(0,8) to (0,8), and which is such that (86) is commutative (with L 
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instead of L,), is of the form L, for suitable g. It is then clear that L, is a 
linear isomorphism of §,, onto ,,. A trivial calculation shows that L; is a 
homomorphism of the Lie algebras hp, and §,, if and only if p,, p2 and q 
are related as in (81). This completes the proof of the lemma. 


We shall call a skew symmetric bilinear map p of g x g into t closed if 
it satisfies (78). We shall say that p is exact if there exists a linear map q of 
g into f such that 


(87) P(X, Y) = q([X,Y]) (X, Veg). 


The vector space of all closed skew symmetric bilinear maps of g x g into 
f is denoted by Mr (g), and the subspace of exact elements is denoted by 
G(g). We write 


(88) Mr(g) = Me'(g)/G(g). 


Corollary 7.35. For a pe Me (g), p is exact if and only if {0} xf is a semi- 
direct summand of }». 


Proof. If p is defined by (87) for some linear map q of g into f£, it is 
immediate that the range b of the map X —> (X,q(X)) is a subalgebra of 
h, such that «,[f] O 6=0 and a,{t]+b=9,. Conversely, let b be such a 
Lie subalgebra. Then for each X eg, let us define the element q(X) et 
by the condition that (X,q(X)) €b. q is evidently a linear map of g into 
t, and the condition [(X,q(X)),(¥,¢(Y))]=([X, Y],q({X, Y])) leads to (87). 

The theory developed so far has established a canonical isomorphism 
between the groups M,(G) and M,.(G@), and also a canonical one-one 
correspondence between M,.(G) and Mig) (G@ simply connected). Now 
Wtr(g) is a vector space. On the other hand, if we write K* additively, it is 
isomorphic to a vector group, so that M x.(G) also becomes a vector space, 
with addition as the group operation in it. It is indeed most natural to 
expect that the canonical correspondence between the elements of 
M ,.(G) and M(g) is actually a linear isomorphism. We shall now prove 
that this is indeed so. We assume K* = R”. The R*-multipliers for G then 
satisfy (14) and form a vector space over R, and the exact ones form a 
subspace of it. M,»(G) is thus a vector space. 


Lemma 7.36. Let G be connected and simply connected. For any analytic 
R"-multiplier m for G, let [m] be the similarity class of m. Let bim be the 
element of Mpr(g) which corresponds to the equivalence class of central 
extensions of R” by G which are associated with [m]. Then 


(89) [m] ~> bim) 


is a linear isomorphism of M p(Q) onto Mpr(q). 
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Proof. Let m be an analytic R*-multiplier for G and let H,, be the 
associated group extension. For (x,k) and (y,k’)¢ Hmn (=Gx,,R" in our 
earlier notation), 


(x,k)(y,k’) = (xy,k+ k’ +m(z, y)). 


Now (e,0) is the identity of H,,, and H,, has the product analytic structure. 
We shall now determine the Lie algebra h of H,,. Following Chevalley [1] 
we regard elements of h as left invariant vector fields on H,,; for each 
(x,k) € Hm, we identify the tangent space to H,, canonically with G, x R", 
G, being the tangent space to G at x. Of course we identify the tangent 
space to #" at any of its points with R” itself in the usual fashion. 

Let X € g be the left invariant vector field x —> X, on G and let u e R". 
We write (X,u) for the left invariant vector field of H,, which assigns to 
(e,0) the tangent vector (X,,w). Since left translation by (2z;k) is (y,k’) > 
(xy, k+k'+m/(z,y)), we see that the tangent vector assigned by (X,u) at 
(z,k) is given by the expression 


X put (5 m(x, exp tă )) 


Write now for any C® function f on GxG and Y,,---, Y,, Zitte, BEQ, 
yYy,zeđ, 


Yoe Y,: 2; 2,:+: Z 
(90) f(y; Yı 1 ) 
= (O° + Sot; . -0t ðu Dus )i, = 


t=0 


soot =u "=u, =O? 


where 
plti, bey Urs, Us) 
= f(yexpt,Y,---exp#t,Y,,zexp u,Z,---exp u,Z,). 
Then (X,u) is given by 
(91) (X,u): x, k— (X, u+m(z : e; X)). 
From (91) we find 
(92) [(X,w),(¥,v)] : x, & —> ([X,Y],, m(x; X: e; Y)—m(zx; Y :e; X)). 
From the identities (14) we obtain, on differentiation, 
m(x; X :e; Y) = m(x :e; XY)+m(e; X :e; Y). 

This then leads to 
(93) [(X,u) (Y w)] : x, k —> 

(X, VY], m(x : e; [X,Y])+m(e; X : e; Y)—m(e; Y : e; X)) 
so that finally, on comparing (93) with (91), we obtain 
(94) = [(X,u),(¥,v)] = (X, Y], m(e; X : e; Y)—m(e; Y :e; X)). 
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In other words, if we write 
(95) Bal(X, Y) = m(e; X :e; Y)—m/(e; Y : e; X) 


then the Lie algebra h of H,, can be identified with g x R” in a canonical 
fashion and 


(96) [(X,u),(¥,v)] = (X, ¥],B,(X, ¥)). 


B,, 18 evidently a skew symmetric bilinear map of gxg into R”. The 
Jacobi identity applied to (96) leads as before to the fact that B,, is a 
closed form. 


We know already that [m] —> bim is a one-one correspondence of 
M px(G@) onto Mpr(g), and (96) shows that bim is the element of Mtp»(q) 
defined by Bm. But a look at (95) shows that B,(X,Y) depends linearly 
on m, for fixed X, Y e g. Consequently, [m] —> bim is a linear isomorphism 
of M,(G) onto Mp(g). This proves the lemma. 


Collecting all these results together we obtain the following theorem. 
We write G for the Lie group we have been concerned with. K is a connected 
abelian Lie group, A* a connected, simply connected covering group of 
K, and e is a covering homomorphism of K* onto K. 


Theorem 7.37. Let G be a connected and simply connected Lie group, 
and let K, K*, and e be defined as above. Then any K-multiplter for G is 
similar to one which is analytic on Gx G. For any analytic (respectively C”) 
K*-multiplier m*, e o m* is an analytic (respectively C®) K-multiplier for 
G, and every analytic (respectively C”) K-multiplier can be represented in 
this form; m,* and m,* are similar if and only if eo m,* and eo m,* are 
similar. If we associate with any equivalence class of central extensions of K 
by G the equivalence class of associated extensions of t by g, we obtain a 
one-one correspondence whose range is the set of all equivalence classes of 
central extensions of t by g. These latter classes are in canonical one-one 
correspondence with elements of the space Wilg). If we write K* additively 
and treat it as a real vector space, M,.(G) becomes a real vector space and the 
induced correspondence of Mg- (G) with M(g) is a linear isomorphism. 
Finally, a K-multiplier G is exact if and only tf, in the associated extension 
of t by g, the image of t is a semidirect summand. 


5. EXAMPLES 


We shall now discuss a number of examples which illuminate various 
aspects of the theory described in Sections 1 through 4. We restrict our- 
selves to the case of (ordinary) multipliers and #’-multipliers. 
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Covering Groups: There is no direct connection between simple con- 
nectedness and exactness. We shall give, further on, examples of connected 
but not simply connected Lie groups admitting only exact multipliers, 
and of simply connected Lie groups whose multiplier groups are not 
trivial. There are, however, some elementary relations between multipliers 
and representations of a group and its covering group that may be 
worthwhile putting down. 

Let G be a connected Lie group, G* a connected and simply connected 
covering group of G, and ô the covering homomorphism. Let x* —> V, 
be an irreducible unitary representation of G* in a Hilbert space # By 
Schur’s lemma, V maps every element in the center of G* into a scalar 
multiple of the identity. Since the kernel Z of ô is contained in the center 
of G*, it follows at once that V,.,.=y(z*)V,. for all z* € Z and 2* e G*, 
x being a character of Z. This implies that for any g e G, all the operators 
V, with 6(g*)=g, define the same element of the projective group of #. 
In this way, each irreducible representation of G* induces a projective 
representation of G. 

The relation between G and G* is also useful in another way. Let m be a 
multiplier for G and let m* be the multiplier x*, y* —> m(8(x*),8(y*)). It 
is then easily seen that m — m* induces a homomorphism, say h, of (G) 
into M(G*). Since G* is simply connected, theorem 7.37 can be applied 
to the study of M(G*). In this way useful global insights into the structure 
of M(G) may be obtained. However, it must be remembered that h is in 
general neither injective nor surjective, so that, in this approach to the 
analysis of M (G) in terms of M(G*), the kernel and the range of h must 
both be determined. For example, if G is a torus, M(G) is trivial while 
M(G*) is nontrivial as soon as dim G*>1; if G is semisimple, M(G*) is 
trivial, but M(G) will not be (cf. supra). 


G=R:*, a vector group in s-dimensions. In this case we can explicitly 
describe the multiplier group (cf. also Weyl [1], pp. 273-274). We formulate 
our conclusions in the form of the following theorem. It is an easy 
consequence of the results described in theorem 7.37. 


Theorem 7.38. For any real skew symmetric bilinear form p on R" x R’, 
(97) My : £, Y —> exp ip(x, y) 


is a multiplier for R*. The mapping which assigns to p, the similarity class 
of my, 18 an isomorphism of the additive group of all skew symmetric bilinear 
forms on R! x R! onto the multiplier group M(R°). In particular, all multi- 
pliers of R! are exact. 


Proof. It is trivial to check that m, is a multiplier for any bilinear 
form p on R'xR’ (skew symmetric or not) and that M, +p, =Mp Mp, 
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To prove the remainder of the theorem we pass to the covering group R! 
of T and use theorem 7.37. Now, for any bilinear form p,2,y—> p(z,y) 
is an R!-multiplier for R* and it is exact if and only if it is symmetric 
(corollary 7.14). In particular, p,~/p, if and only if p, — P; is symmetric. 
If p, and p; are skew symmetric, it is now obvious that p, ~ po, if and only 
if p, =p2. If p is any bilinear form, we can always write it as p, +c, where 
pı is skew symmetric and c is symmetric; then p~p. These conclusions 
then carry over to the 7'-multipliers m,. 

In order to complete the proof we go back to (95). We write B,=B,,, 
and prove that the mapping p —> B, has the set of all skew symmetric 
bilinear maps of R’ x R’ into R! for its range (we identify g with R” also 
in the obvious fashion so that exp is the identity map). This is trivial, for, 
by (95), 


~iB,(X,Y)=2p(X,Y) (X, YeR’), 


whenever p is a skew symmetric bilinear form on R’xR’. The last 
assertion follows trivially from the fact that 0 is the only skew symmetric 
bilinear form on R! x R}, 


Connected Semisimple Lie Groups: Suppose g is a semisimple Lie algebra 
over the reals and (§,«,8) a central extension by it of the abelian Lie 
algebra t. Since §/a[f] is semisimple, it follows at once that o[f] is the radical 
of h. By the Levy-Maléev theorem we conclude that there exists a sub- 
algebra b of ) such that o[f]+6= and e[t] ^ b=0. This observation also 
leads at once to the conclusion (cf. theorem 7.37) that if G is a connected 
and simply connected semisimple Lie group, then every multiplier for G is 
exact. Since the classical groups are semisimple, this result applies to the 
universal covering groups of these groups. For example, we may take Go 
to be the connected component of the group of all invertible linear 
transformations of an s-dimensional vector space leaving invariant a 
nonsingular symmetric bilinear form. When s>2, Gy is known to be a 
semisimple Lie group. If G denotes a universal covering group of Go, then 
all multipliers for G are exact. The examples of greatest interest from the 
physical point of view arise when s=3 and the bilinear form has a positive 
definite quadratic form, and when s=4 and the bilinear form has a 
Minkowskian quadratic form. In other words, the universal covering 
groups of the rotation and Lorentz groups (connected) admit only exact 
multipliers. 


Inhomogeneous Groups: The symmetry groups of space time cannot be 
semisimple because they contain the space-time translations as a normal 
subgroup. Therefore, the results derived in the previous example are 
inadequate for physical applications and it is necessary to extend their 
scope. We shall give here one such extension. 
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Let G be a connected Lie group and let p(x — p(x)) be a (not necessarily 
unitary) representation of G in a finite dimensional real vector space V. The 
map x, v—> p(x)v is analytic from Gx V into V and converts V into a 
G'-space. The semidirect product 


(98) G,=Gx,V 


can then be formed. G, (in the product analytic structure) is a connected 
Lie group; if G is simply connected, it is a simply connected Lie group. 
We identify V and G with closed subgroups of G,. V is normal and for 
zeG, ve V, vs“ t=p(xw. We shall call G, the inhomogeneous group 
associated with G and p. 

Our purpose now is to prove a theorem from which it follows that 
under certain conditions the covering group of the inhomogeneous group 
admits only exact multipliers. We shall call the representation p of Gina 
real vector space V admissible if no nonzero skew symmetric bilinear form 
on V x V is invariant under p. 

The fact that many representations of a group are admissible is con- 
tained in the following lemma. Since we work with real vector spaces it 
has to be formulated in the context of absolutely irreducible representa- 
tions. A representation n(x — n(x)) of G in a real vector space W is said 
to be absolutely irreducible if the only endomorphisms of W which commute 
with all the r(x) are scalar multiples of the identity. If G is semisimple, 
absolute irreducibility implies irreducibility in the usual geometric sense 
and absolute irreducibility of 7 is equivalent to the irreducibility of 
a in the complexification of W (G semisimple). The classical Schur’s 
lemma can be formulated for absolutely irreducible representations as 
follows: if 7 and 7’ are absolutely irreducible representations of a semi- 
simple group G, acting in W and W’, then the vector space of linear 
maps L from W into W’ such that La(x)=7'(x)L for all x eG is of di- 
mension <1, the dimension being 1 if and only if m and 7’ are equivalent. 


Lemma 7.39. Let p be a representation of a connected semisimple Lie 
group G in V. Suppose that V is a direct sum of p-invariant subspaces 
Viste, V, such that for each i, (i) the subrepresentation p, (of p) defined by 
V, is absolutely irreducible and (ii) there exists a nonzero symmetric bilinear 
form on V,x V, which is invariant under p,. If the representations p1, post, 
ps are mutually inequivalent, then p is admissible. 


Proof. Let V* be the dual of V and for x e G and te V*, let p*(x)t be 
the element v —> t(p(x7 1w) of V*. Then p*(x)(t —> p*(zx)t) is easily verified 
to be an invertible endomorphism of V*. It is also easy to verify that 
p* : x — p*(x) is a representation of G in V*, the so-called contragredient 
representation acting in V*. Now any bilinear form B on V x V gives rise 
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to a linear transformation Ts of V into V*, and B is invariant if and only if 
T p intertwines p and p*, i.e., 


(99) T pp(x) = p*(x) Tg 


for all x e G. Let V,*< V* be the annihilator of >,,, V; Then p* leaves 
each V,* invariant. From the fact that p, leaves a nonzero bilinear 
form on V,x V, invariant, it follows that there exists a nonzero linear 
map T, mapping V, into V,* and V, into 0 for 747, such that T, inter- 
twines p and p*. From the standard form of Schur’s lemma we conclude, 
using the mutually inequivalent nature of p,,---, p;, that the operators 
T,,--+, T span the vector space of operators which intertwine p and p*. 
This implies that every operator which intertwines p and p* is a Tp for a 
symmetric bilinear form Bon V x V. Hence any bilinear form on V x V 
left invariant by p is necessarily symmetric. Consequently, p is admissible. 


Theorem 7.40. Let G be a connected semisimple Lie group and p an ad- 
missible representation of G in a real finite dimensional vector space. Then, 
for the universal covering group G,* of G,, every multiplier is exact. In 
particular, if a(x —> az) is a representation of G, into the group of auto- 
morphisms of the logic of a (complex separable) Hilbert space X, there exists a 
representation U(x* —> U,.) of G* in # such that for each x in G, a, ts 
induced by U, for any x* in G,* lying above x. If p does not contain the 
trivial representation as a subrepresentation, U is the only representation of 
G,* with this property. 


Proof. Let us first describe G,* in a specific manner. Let G* be a 
connected and simply connected covering group of G and let e(x* — x) 
be the covering homomorphism. Then poe=p* : 2*-> p(e(x*)) is a 
representation of G* in V. p* is obviously admissible and we form the 
group 
(100) G* = G* xV. 


It is clear that G,* is connected and simply connected and the map 
x*, v —> e(x*), v is the covering map of G,* on G, (we identify V and G* 
with subgroups of G,* as usual). If « is the given representation of G, in 
the group of automorphisms of the logic of # we know (cf. footnote in 
Section 1) that there is a projective representation V+ of G, in # which 
induces a. We lift V+ to the projective representation V : 2*, v —> Vii.) , 
of G,*. We now apply theorem 7.5. In order that a be induced by a 
representation of G,* in the manner described in the theorem, it is 
necessary and sufficient that there is a unitary representation of G,* 
which gives rise to the same representation as V in the projective group of 
H; clearly this will happen if and only if the multiplier of V is exact. 
Also, the representation of G,* which induces œ is unique up to 
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multiplication by a continuous homomorphism of G,* into the group T 
(corollary 7.7). Thus, in order to prove the assertions of the theorem, it is 
enough to prove two things: first, that all multipliers for G,* are exact, 
and second, that G,* admits no nontrivial continuous homomorphisms 
into T under the assumptions made on p. 

We take up the first point. Let g, v, and g, be the Lie algebras of G*, 
V, and G,*, respectively. v is an ideal in g,, g a subalgebra in g,, and 


vag = 0, 
v+g = 8p 
If $ is the representation of g in V that corresponds to the representation 


p* of G*, we shall, by using the canonical isomorphism of v on V, assume 
that p and p act also on v. Thus, for Z € v and X € 9, 


(101) [X,Z] = p(X)(Z); 


this corresponds to the relation 2*vx*~* = p(e(x*))u (x* e G*, ve V). 

In order to prove that all multipliers for G,* are exact, it is sufficient 
to prove that if t is any abelian Lie algebra and (§,p,q) a central extension 
of t by g,, then p[f]is a semidirect summand of . We shall now demonstrate 
this. We write 


(102) v = g~*(v) 
and 
(103) g° = q-*(Q). 


q maps gq’ onto g and the kernel of q is p[f]. Hence we conclude from 
lemma 7.25 that there exists a subalgebra a such that an p[t]=0 and 
a+ p[f]= g’. q is an isomorphism of a onto g, so that a is semisimple. Since 
b N g=0, vo’ ON acpi] so that v’ N a=0. In other words, 


vb’ Oa = 0, 


104 
(104) b’ta = h. 


Now v’ is an ideal in h, and so for any X €a, (ad X)(Z)=[X,Z] lies in 
v’ whenever Z € v’. Thus X — ad X defines a representation of a in v’. As 
p(t] is central, ad X annihilates all the elements of p[t]. Therefore, as a 
is semisimple, there exists, by Weyl’s theorem (cf. Chevalley [3], pp. 70-73) 
a subspace b, such that b,  p[f]=0, b, + p[f]= v’, and b; is invariant under 
all ad X, X ea. Let 


(105) b = b, +a. 


b is a linear manifold, b N p[t]=0, and b+ p[f]=). Our claim that p/[f] 
is a semidirect summand will be proved if we show that b is a subalgebra 
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of h. Since [X,Y] e b, whenever X ea and Y €b, and since a is a sub- 
algebra, we have only to show that [X,Y] eb, whenever X, Y are in by. 
We shall actually prove that [X,Y]=0 for X, Y in b,. Now q maps b, 
onto b, so that the abelian nature of » implies that, for X, Y e b., 
[X,Y] e p[t]. We write 


(106) Ky(X,Y) = p` (X,Y) (X, Ye by). 


K, is evidently a skew symmetric bilinear map of b, x b, into f. If Zea 
and X, Y e b,, [X,[Y,Z]]+[Y,[2,X]]=0, since this element is nothing but 
—[{Z,[X,Y]], and hence 0 as [X,Y] belongs to the center of ). Therefore, 


(107) Ki((Z,Y],X)+K(Y[Z,X]) = 0. 


Now q is an isomorphism of the Lie algebra a onto g, and the vector space 
b, onto v. Therefore there exists a unique skew symmetric bilinear map 
K of v x v into t such that 


(108) K(X’, Y’) = K(q(X"),g(X"))  (X', Y'eb,) 
and 

(109) K((Z,Y],X)+K(Y.[Z,X]) = 0 

for all X, Y in b and Z in g. In other words, 

(110) K(p(Z)Y¥,X)+K(Y,6(2)X) = 0. 


Since (110) is valid for all Z in g, we conclude that K is invariant under the 
representation p. If we choose a basis {A,,---, A,} for t and write 


K(X,Y) = > K(X,Y)A, 
i=1 


then each K, is an invariant skew symmetric bilinear form on v x v. Each 
K, is 0, as p is admissible. This implies that K=0. Therefore, K,=0. 
This proves that b =b, +a is a subalgebra of §. In other words, p[t] is a 
semidirect summand of h. 

We finally come to the second point, namely, to the proof that under 
the conditions mentioned in the theorem G,* admits no one dimensional 
representation other than the trivial one. If it does, g,=»v+g will admit 
a nontrivial one-dimensional representation. Therefore there will exist a 
(complex valued) nonzero linear function A on g, such that A([X, Y])=0 
for all X, Y in g,. Since g is semisimple, every element of g is a linear 
combination of elements of the form [X,Y] with X, Y in g (cf. Chevalley 
[3], p. 67) so that A=0 on g. Let à’ be the restriction of A to v. Then the equa- 
tion A([Z,X ])=0, for X € v and Z eg, shows that the null space of A’ is 
invariant under p. Since p is semisimple, we can select a subspace 
complementary to the null space of à’ and left invariant by p. This space is 
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one-dimensional and so, as g is semisimple, we obtain a nonzero vector 
fixed by p. This is a contradiction. The proof of the theorem is now 
complete. 


This is a basic theorem insofar as relativistic applications are concerned. 
It tells us, for example, that any representation of the (connected) 
inhomogeneous Lorentz group can be obtained from a unitary repre- 
sentation of the covering group. To see this, all we have to do is take V as 
k*, G as the connected component of the group of Lorentz matrices acting 
on V, and x —> p(x) as the action of G in V. It is quite easy to check that p 
is admissible. In this special case, theorem 7.40 was obtained by Wigner 
[1] and, in his work, this theorem was the point of departure for his classifi- 
cation of relativistic wave equations. In the special case when G is the group 
of all invertible linear transformations of V leaving a nondegenerate 
quadratic form on V invariant (dim V >2), theorem 7.40 was obtained by 
Bargmann [1]. 


Compact Groups. Much is known about multipliers for compact groups 
(cf. Moore [1]). We shall content ourselves with the proof of a simple 
result. 


Theorem 7.41. Let G be any compact group satisfying the second axiom 
of countability. Then its multiplier group is a torsion group, t.e., every 
element of M(G) is of finite order. If G is a torus, or a connected and simply 
connected compact Lie group, G admits only exact multipliers. 


Proof. Let m be a multiplier for G. By corollary 7.18 there exists a 
finite dimensional m-representation p(x — p(x)), acting in V say. Let 
¢ — ¢/" denote a Borel nth root of ¢, i.e., a Borel map of T into itself 
which fixes 1 and whose nth power is the coordinate function (—> ¢, n 
being the dimension of V. Then 


p' : x —> (det p(x))"p(z) 


is a projective representation of G; and p’ defines the same homomorphism 
of G into the projective group of V, as p. If m’ is the multiplier of p’, then 
m and m’ are similar. But det p’(x)=1 for all x so that for all x, y in G, 
[m’(x,y)|]"=1. Hence (m)" is exact, proving that any element of M(G) 
has finite order. 

If G is a connected and simply connected compact Lie group, then G is 
semisimple by a theorem of H. Wey] (cf. Helgason [1]), so that G admits 
only exact multipliers. 

Suppose finally that G is a torus, say G=T". Let e :v— e(v) be the 
covering homomorphism from R” onto T”. If m is any multiplier for G, 
m,:v,v' > m(e(v),e(v’))is a multiplier for R”. Since m’ is exact from some 
integer s>1, so is m,’. But the multiplier group of R” is canonically iso- 
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morphic to the R!-multiplier group of R” which is a vector group. Hence 
m, cannot be exact unless m, is itself exact. Therefore, there is a Borel 
function a from R” to T such that m,(v,v') =a(v+v')/a(v)a(v'). Since eis a 
local group isomorphism, we can find a Borel function b on T”, with values 
in T, and an open set N of T" containing the identity, such that 


b 
(111) m(2,y) = roe 


for all x, ye N x N. Equation (111) implies that m is exact (cf. corollary 
7.14). 


Groups Which Are Not Semisimple. If the Lie group G is not semi- 
simple, then it may admit nonexact multipliers. We shall now sketch the 
construction of the multiplier group for the inhomogeneous Galilean 
group. We closely follow Bargmann [1] in this discussion. 

If (x,t) (v=, £2, %3) denotes the coordinates of space-time points, 
then an inhomogeneous Galilean transformation is a map 
(112) r:x,t— x,t, 
where 

x = Waet+tot+u, 
(113) 
t = t+, 


W being a rotation in 3-space (det W = +1), u, v vectors in 3-space, and 
ņ a real number. The set of all such transformations forms, under com- 
position, the so-called inhomogeneous Galilean group; we have, for the 
multiplication in this group, 


(114) (W,n,v,u)(W’,n',v',w’) = (WW n+7',Wo' +v, Wu tutn o). 
If we associate with r the matrix 


W v u 
(115) Mir)={0 1 7], 
0 0 1 


then r —> M (r) is a faithful representation. The Galilean group is a semi- 
direct product; the subgroup A of all translations in space-time, 


(116) A = {(1,7,0,u)} 
is normal, the subgroup Go of transformations 


(117) Go = {(W,0,v,0)} 
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is a closed subgroup; A and G, generate the whole inhomogeneous group. 
In fact, 


(118) (1,7,0,u)(W,0,v,0) = (W,n,v,u) 
and 
(119) (W,0,v,0)(1,7,0,u)(W,0,v,0)-* = (1,7,0, Wu +qw), 


which exhibits explicitly the semidirect product nature. We denote the 
inhomogeneous group by G. The subgroup Go, unlike the Lorentz group, is 
not semisimple, and this accounts for some of the more subtle differences 
between the Galilean theory and Lorentzian theory. Let M and H denote 
the subgroups of G, given by 


(120) M = {(1,0,v,0)}, 
(121) H = {(W,0,0,0)}. 


M is a closed normal subgroup of Go, H is a subgroup with G>=MH; 
moreover, 


(122) (W,0,0,0)(1,0,v,0)(W,0,0,0)-2 = (1,0, Wv,0), 


so that G, is once again a semidirect product, with respect to the action 
of H in M given by (122). 

Note also that the space motions of the form (W,0,0,u) constitute the 
subgroup S= UH, where U is the subgroup (of all translations in space) 
{(1,0,0,2)}. U is a normal subgroup of S and 


(123) (W,0,0,0)(1,0,0,«)( W,0,0,0)-+ = (1,0,0, Wx); 


therefore once again we have a semidirect product. 

The first step in determining the multiplier group of G is to examine 
the central extensions at the Lie algebra level. Theorem 7.37 reduces this 
to the question of determining the closed skew symmetric bilinear forms 
on g x g, modulo the exact ones. We shall now do this. As is customary, we 
denote the subalgebras of g by the German letters which correspond to 
the Latin letters that denote the analytic subgroups of G. The relations 
(116) through (123) give considerable insight into the structure of g. 
a is an ideal in g, g=a+Qo, a N 8o=0; m is an ideal in go, gg>=m+h, 
m N h=0; gp and § are subalgebras of g. Also 3=u+ h is a subalgebra, u 
is an ideal in 3, uN §=0. 

Since A, M, and U are abelian and since H acts on all of them via the 
inner automorphisms, H acts naturally on a, m, and u. The differential of 
this action is the usual adjoint action of ý on a, m, u. Thus, using the 
standard notation according to which [p,’] is the linear space spanned 
by all elements of the form [X,Y] with X e p and Yep’, 


(124) (§,m] E m, [hu] S u, 
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and 


(125) [9p] = 0, 


where p is the one-dimensional subalgebra of g which corresponds to the 
subgroup {(1,7,0,0)} of A. 

Moreover, from (122) and (123) we see that the representations of H 
in m and u are equivalent and absolutely irreducible and moreover, leave 
the Euclidean inner product on these invariant; these are therefore 
admissible by lemma 7.39. By a direct calculation or by use of the 
irreducibility of these representations of 9, we find 


(126) [bm] =m,  [h,u} = u. 
Moreover an easy computation shows that 
(127) [pm] cu, [um] = 0. 


Let B be any closed skew symmetric bilinear form on g x g. Then, for 
X, Y ceaand ZE go, we have, as [X, Y]=0, 


(128) B([Z,X],¥)+ B(X,[Z,¥Y]) = 0. 


Equation (128) shows that B is invariant under the representation of Gy 
in a. Since the action of Go on A leaves no nonzero bilinear form on 
A x A invariant, (128) implies at once that B=0 on axa. Note that this 
has been proved for any closed form B. Next, we observe that go=m+ 
satisfies the conditions of theorem 7.40. In fact, the group H is semi- 
simple and its action on M (which is isomorphic to 3-space) as given by 
(122) is, as observed by us already, admissible. Hence the restriction of 
B to 9o X Go is exact. Replacing B by a form which differs from B by an 


exact one, we may thus assume that 
(129) B=0 on axa and gx Qp.- 


We shall study the form B on ax g. Now a+ is a Lie algebra and a 
is an ideal in it. Moreover, the action of H on A, as given by (119), is 
admissible by lemma 7.39, since A decomposes into a direct sum of U 
and the one-dimensional space of time translations. Hence theorem 
7.40 applies once again and we conclude that the restriction of B to 
(a+) x (a+) is exact. Therefore, there exists a linear function q on g, 
vanishing on m, such that B(X,Y)=q([X,Y]) for all X, Y in a+. Since 
§ is semisimple, [§,§]= so that (129) implies that q vanishes also on h. 
If we define B’ by 


(130) BY X,Y) = BXX,Y)—-q([X,¥]) (xX, Yeqgxg), 
then B — B’ is exact and 
(131) B'=0 on axa,axh, and go Qo. 
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If Xep, Yemand Ze ù, the identity 
B'(X[Z,Y])+ BZ. Y,X))+ B(Y,[X,Z]) = 0 


implies, in view of (131) and the relations [Y,X] € u, [X,Z]=0 (cf. (127)), 
that 


(132) B(X[Z,Y]) = 0. 
From (126) and (132) we conclude that 
(133) B'=0 on pxm. 


We are now ready to prove that the set of closed forms, modulo the 
set of exact forms, is one dimensional. To this end, we must analyze the 
restriction of B’ to ux m. Now, for Z e h, we have, as [11,m]=0, 


(134) B'([Z,X], Y)+ B(X[Z,Y]) = 0 


for all X, Yeuxm. Since H induces equivalent representations on u and 
m, the vector space of bilinear forms on u x m has a one-dimensional sub- 
space of bilinear forms invariant under the natural action of H. Let Co 
be a basis for this space. Let C be defined as the unique symmetric bi- 
linear form on gx g such that C=C, on uxm and C=0 on all of axa, 
8o X 8o ax H, and pxm. Then (131), (133), and (134) show that B’ is a 
multiple of C. In other words, the vector space of closed forms modulo 
the subspace of exact forms has dimension <1. On the other hand, it is 
easily verified that C is actually closed. If C were exact, and C(X,Y)= 
q({X,¥]) for all X and Y in g, then the equation [u,m]=0 would imply 
that C=0 on u x m, a contradiction. Thus the space I%p1(g) has dimension 
exactly 1. 

Therefore there exists a nonexact R!-multiplier m,* for the covering 
group G* of G such that every multiplier for G* is similar to one of the 
form 


x*, y* —> exp[irmy*(x*,y*)] 


for a suitable real number r. 

Actually m,* can be chosen to induce a nonexact R!-multiplier m, for 
G itself. Bargmann [1] has indicated a simple method for constructing mo. 
Let us write €=(W,n), C=(v,u). Then we can expand mp as a series 
Èn Pn» Where ¢, is a polynomial in č}, ča, homogeneous of degree n and 
having coefficients which are functions of é and £é. Since the ¢’s are 
transformed linearly in the group multiplication, it follows that each p, 
satisfies the multiplier identities locally. This tells us that a simple and 
promising class of functions, in which we may search for the multipliers, 
would be the class of functions which are analytic in é, &, Či, 2, and which 
are bilinear in (, and ¢, for fixed é, é. If we assume that mọ is of this 
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form and write down the multiplier identities, we obtain equations 
which are quite easy to solve. We find 


(135) mo(r,r’) = <u, Wv — <v, Wu’) +q <v, Wo’, 
where <.,.> denotes the Euclidean inner product on 3-space and 
(136) r = (W,n,v,u), r = (Wn ovu’). 


It is of course necessary to check that m,, when lifted to a Rt-multiplier 
m,* for G*, is not exact. But suppose m* is exact. Then m9*(x*,y*) = 
Mo*(y*,c*) whenever x* and y* commute, and hence m)(x,y)=mo(y,2) 
if x and y commute and are close to the identity. We contradict this by 
choosing x,=(1,0,sv,0) and y,=(1,0,0,su), with s near 0; z7,y,=y,%, but 
Mo(Xs, Ys) — Mo( Ys,%5) = — 2s?{u,v». 


Theorem 7.42. Let G be the inhomogeneous Galilean group. Then for any 
real number T 


(137) M, : r,r’ —> expt 5 [<u, Wv > — <v, Wu Y +n <v, Wo’) ] 


ts a multiplier for G', where 
r = (W,y,v,u) and r = (W’,y’,v’,u’). 


Every multiplier of G is similar to one of this form. The m,, for distinct r, 
are not similar. In particular, m, is nonexact for 7#0. 


NOTES ON CHAPTER VII 


For a closely related treatment of multipliers see K. R. Parthasarathy, 
Multipliers on locally compact groups, Lecture Notes in Mathematics, 
No. 93, Springer-Verlag, Berlin, 1969. For a systematic treatment of the 
cohomology of locally compact second countable groups see the papers of 
C.C.Moore: Trans. Amer. Math. Soc., 113 (1964), pp. 40-63, 113 (1964), 
pp. 64-86; 221 (1976), pp. 1-33; 221 (1976), pp. 35-58; see also his review 
article in Group representations in Mathematics and Physics, Lecture Notes 


in Physics, No. 6, Springer-Verlag, Berlin, 1970. 


CHAPTER VIII 
KINEMATICS AND DYNAMICS 


1. THE ABSTRACT SCHRODINGER EQUATION 


In this chapter we shall use the main results of Chapters V through VII 
to study the dynamical and kinematical properties of quantum mechanical 
systems. This will lead to the Schrödinger equation, the well known 
formulas for the position, momentum, and spin observables, and to the 
standard expressions for the Hamiltonians which enter the quantum 
theory of the atom. 

The first problem we study is that of determining the dynamical 
group of a system, say ©. According to our general principles, we must 
exhibit the dynamical group as a one-parameter group of convex 
automorphisms of the convex set of all states of ©. In this section we 
obtain the form of the most general dynamical group of a system whose 
logic is standard. Our analysis leads to the so-called abstract Schrödinger 
equation of motion. This equation, in its usual form, gives a set of direc- 
tions for computing the possible values of the energy observable and 
describes the development of the state of the system in time. In keeping 
with the spirit of our development, we can say that the Schrödinger 
equation gives the infinitesimal form of the temporal development of an 
arbitrary quantum mechanical system. | 

Let us then consider a system ©; the quantum mechanical character of 
© is put in evidence by the assumption that the logic of © is standard. 
We shall actually assume that the logic of © is that associated with the 
complex number field. We write # for a separable infinite dimensional 
Hilbert space over C whose logic X is isomorphic to that of ©. We 
identify Z with the logic of S. We write S for the convex set of all states 
of F. 

According to our earlier discussion in Chapter III, the dynamical 
group of © is then to be described by a Borel one-parameter group t — D(t) 
of convex automorphisms of S the Borel structure being such as to make 
the maps 

t > (D(t)p)(a) 
Borel, for all pe Y andae L. Now, by Gleason’s theorem (cf. Chapter IV), 
A — pa is a convex isomorphism of the set of von Neumann operators of 
288 
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trace 1 onto S, pa being the state a > tr(P*A) (ae L). Consequently, each 
D(t) induces an automorphism of the convex set of von Neumann operators 
of trace 1, so that we can select, for each t, a symmetry U, of W such that 


(1) Dip, = Pu,au,7} 


(cf. theorem 4.33). As D(t)=D(t/2)?, it follows that each U, is a unitary 
operator. Now, the map t—> U, need not have any measurability properties, 
as the choice of U,, for each t, was made quite arbitrarily. But, if u and v 
are unit vectors of #, and if we write A for the projection on the space 
spanned by u, and L for the space spanned by v, 


Puau (L) = KU u,v) |?. 


Consequently, the map ¢— |(U,u,v)|? is Borel for all u, v e 3. If we 
therefore write 7 for the canonical homomorphism of the unitary group of 
H onto the projective group of #, this implies (corollary 7.2) that the 
map ¢->7(U,) is a Borel homomorphism of R? into the projective group. 
Consequently it is induced by some projective representation of R! in 7 
(cf. Chapter VII). Now, according to theorem 7.38, all multipliers of R? 
are exact. Hence there exists a (unitary) representation V 


of R! in #, such that for each ¢, and each von Neumann operator A of 
trace 1, D(t) is given by 
(3) D(t) : Pa —> Py,ay,-}- 


The representation V is determined up to multiplication by a character 
of the real line, i.e., if V’(t > V;,') is another representation satisfying (3), 
then there is a real number c such that 


(4) Vy = ey, 


for all t. 

To the one-parameter group V we can apply Stone’s theorem (cf. 
Riesz-Nagy [1]). According to this theorem, there exists a unique self- 
adjoint operator H such that 


(5) V, = exp(—itH) 

for all t; the domain 2; of H is given by 

(6) Dy = fe : lim ` (Vw—v) exists}, 
t-—0 

and, for v € Zy, 


(7) Hv = i lim 4 (Vø —v). 
t+0 t 
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We can thus say, in view of (5) and (3), that H induces the dynamical 
group t—> D(t). Since the representation V is determined only up to 
multiplication by a character of the real line, D determines H up to an 
additive constant, i.e., H and H+c-1 (c real) induce the same dynamical 
group. Collecting these facts, we have the following theorem: 


Theorem 8.1. Given any self-adjoint operator H on the complex separable 
Hilbert space # of infinite dimension, there exists a unique dynamical 
group D(t— D(t)) of S such that, for each t, D(t) transforms the state 
corresponding to the von Neumann operator A into the state corresponding 
to the von Neumann operator exp(—1tH)A exp(wtH). H and H+c-1 (ca real 
number) induce the same dynamical group. Conversely, any dynamical group 
D(t —> D(t)) of convex automorphisms of S is induced in the above fashion 
by a self-adjoint operator H, and D determines H up to an additive constant. 
Finally, for any t, D(t) transforms the pure state determined by the unit 
vector u into the pure state determined by the unit vector exp( —itH yu. 


Theorem 11.1 is the central result of elementary quantum theory. It 
tells us that the dynamical groups of physical systems correspond one-one 
to one-parameter unitary groups and even to self-adjoint operators, 
provided we ignore additive constants in the self-adjoint operators. Now, 
according to our basic principles, the observables of the system are in 
one-one correspondence with the self-adjoint operators of #. Conse- 
quently, we reach the remarkable conclusion that to every dynamical 
group there can be associated, in a canonical fashion, a class of observables, 
any two of which differ by an additive constant, and that any such class 
will generate a dynamical group. This conclusion brings us to a very 
interesting formal analogy with classical mechanics. In its Hamiltonian 
form (Chapter I, Volume I), the dynamical group of a classical system is 
determined also by a function H on the phase space, i.e., a classical 
observable, and, since H enters the differential equations in terms of its 
first-order derivatives, this observable in turn is determined by the 
dynamical group only up to an additive constant. 

Prompted by this analogy, we shall call any self-adjoint operator H, 
which induces the dynamical group of a system, a Hamiltonian of the 
system. Given a physical system, the observable which corresponds to a 
Hamiltonian of the system is undoubtedly a very crucial one, since it 
describes the dynamics of the system completely. In physics, this observ- 
able is usually identified with the energy of the system. We shall give, 
further on, some of the reasons that make this identification a very 
reasonable one. 

Assuming this identification, we can now recover the classical prescrip- 
tion of Schrédinger for the determination of the energy levels of the system. 
If u is a unit vector of # which describes a pure state of the system under 
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consideration, and if P(E —> Pp) is the projection valued measure giving 
the spectral decomposition of the operator H, then the probability 
distribution of energy, when the system is in the state corresponding to u, 
is the measure 


(8) E —> ||Pyul?s 


if H has a purely discrete simple spectrum, say {A;, A,,---}, and if 
Uz, Ug, are orthonormal eigenvectors of H corresponding to the 
eigenvalues Aj, Ag,---, respectively, then A,, A,,--- are the possible 
values of the energy observable, say Hn, and, in the given state, 


(9) Pr(En = Xp) = [Kumu]. 


Let us now consider the dynamical group induced by the unitary group 
t > exp(— üH). It is well known, and easy to prove in the context of 
Hilbert spaces (cf. Gel’fand [1]), that there are vectors u in # such that 
t —> exp(—?tH)u is an analytic function of t with values in #; and that, 
moreover, such vectors form a linear manifold which is dense in #4 and 
on which H is essentially self-adjoint. If we take such a vector u and write 
u,=exp(—itH)u, then we obtain the differential equation 


(10) — = —tHu, Uy = U. 


The group t —> exp(—7tH) can be clearly recovered from (10). The equation 
(10) is thus the differential equation which describes the motion of the pure 
states of our system. We shall call it the abstract Schrödinger equation 
(cf. Dirac [1], pp. 108-111). 

From the representation t— V, we can now determine how the 
probability distribution of any observable changes with time. Let & be 
any observable and A the self-adjoint operator associated with it. Let 
E —> P, be the spectral measure of A. If the unit vector uo represents the 
state at time t=0, then the distribution of é at time t is the measure q; 
where 


(11) q(H#) = <exp(ttH)P; exp(—ttH)uo,uo>. 


It is of course quite possible that the distribution of £ at time t=0 remains 
unchanged for all time, no matter what the initial state is. From (11) we 
infer that the necessary and sufficient condition for this is that 


Vi PaV, = Pe 
for all t and all Borel sets Æ, i.e., 


(12) exp(—iiH) exp(—1tsA) = exp(—1sA) exp(—itH) 
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for all s and t. The equation (12) yields, on differentiation, 
(13) [H,4] = 0, 


where (13) is interpreted in the strong sense, i.e., the spectral projections 
of H and A commute. In other words, the observable £ must be simul- 
taneously observable with energy. In analogy with classical mechanics we 
shall call an integral of the motion. The reader may notice that (13) is also 
the condition for é to be an integral of the classical motion determined 
by a classical Hamiltonian H, provided we interpret the bracket in (13) 
as the Poisson bracket. Later on, we shall come across further instances of 
this analogy. In particular, energy is always conserved during motion. 


We shall conclude this section with a few remarks on the “Heisenberg 
picture.” Consider a system whose dynamical development is described 
by the one-parameter group t —> V,. The pure states of the system then 
move in time along the parametrized curves t —> V,u. We shall call these 
curves the motions of the system or, alternately, the dynamical states of 
the system. Clearly any such motion t > ù, is completely determined by 
the vector u,. Hence the rays of the Hilbert space of the system can also 
be regarded as representing the dynamical states of the system. If u and 
v are two unit vectors which represent the dynamical states € and y, 
respectively, the number |<u,v>|? can evidently be interpreted as the 
probability of finding the system in the dynamical state 7 when a measure- 
ment is made on it to find out whether it is in the dynamical state £ or not. 
Thus |<u,v>|?, which depends only on the rays determined by u and v, 
has an invariant physical significance. 

It is possible to reformulate the general prescriptions of quantum 
mechanics in terms of the concept of the dynamical state. We shall indicate 
how this is done by. computing the probability distributions of the 
observables as time changes. Suppose x is an observable and A,=A its 
operator. Suppose that the system is in the dynamical state uo. Then, the 
probability distribution of x at time ¢ is the measure 


qi : E —> <P eV tto, Vito); 


where E —> P, is the spectral measure of A. If we write 4,= V, tAV, 
then Pt : E > V,- +P:V, is the spectral measure of A, and 


qE) = <P itot), 


which describes q, entirely in terms of uo. We see, therefore, that the 
probability distribution of x at time t can be computed if we replace A 
by A, but evaluate the relevant inner products with ug instead of u,. In 
other words, we may regard the change in time as inducing a change in the 
operators which represent x; the dynamical state remains unchanged. 


KINEMATICS AND DYNAMICS 293 


It must be realized, however, that this is only an alternative language 
to describe the same state of affairs. In physical literature, this way of 
looking at things is called the Heisenberg picture and the dynamical states 
are called the Heisenberg states. Our original way of looking at things is 
referred to as the Schrödinger picture. 

If A is the operator which represents the observable x, then at time t, 
it will be represented by the operator A,=V,~'AV, in the Heisenberg 
picture. The (formal) time derivative of A, at t=0 then becomes 7[H,A], 
where we write V,=exp(—wH): 


(14) A = i{H,A). 


We thus obtain the celebrated principle of Heisenberg according to which 
the time derivative of an observable is 1 times the commutator of its 
operator (at time 0) with the energy. Appropriately interpreted, (14) 
retains its meaning in classical mechanics also. 


2. COVARIANCE AND COMMUTATION RULES 


We shall now consider a system such as a particle. A characteristic 
property of such systems is that one can associate with each of them, a 
certain C” manifold M, which is usually called its configuration space. If 
we are interested in a classical interpretation of such a system, M would 
actually be its configuration space. Suppose that © is such a system and 
that we are interested in a quantum mechanical description of ©. Suppose 
we still like to treat the configuration of the system as a physical 
observable. The mathematical way to put this in evidence is to assign, 
for each Borel set E&M, a closed linear manifold S(H) of the Hilbert 
space # of GS; S(E) will then represent the experimental statement that 
the configuration of © belongs to Æ. It is, moreover, natural to assume that 
the assignment E —> S(E) is a c-homomorphism of the Borel structure of 
M onto a Boolean sub-o-algebra of the logic # of 4 i.e., that the mapping 
E — PS = P,, which assigns to E the projection on S(£), is a projection 
valued measure based on M. 

Let us now suppose that we have in addition a Lie group G acting 
differentiably on M and that we want a description of our system which 
is covariant with respect to G (for example, we may treat a particle 
in a spherically symmetric potential field in space; M will then be R’ and 
G will be the rotation group). We then associate, with each element g of 
G, an automorphism a, of the logic Z of our system such that a(g —> a,) 
is a representation of G in Aut(#); moreover, in view of the simple 
meaning of the elements S(H) of Z, we shall require that œ and S be 
related as follows: 


(15) a{S(H)] = S(g[H)). 
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We take the point of view that (15) summarizes the physical assump- 
tions we have made. If G is a connected Lie group, we can select a 
projective representation U(g-—> U,) of G in # which induces a. Then 
(15) can be written as 


(16) U,P Ug) = Pop) 


If U is a representation, then (16) expresses the fact that (U,P) is a 
system of imprimitivity. If U were not a representation but its multiplier 
is exact, then we may change U to a representation so that it still induces 
a. In such a case, we may therefore say that the existence of a con- 
figuration space plus the requirement of covariance leads to a system of 
imprimitivity, the representation and projection valued measure of which 
have direct physical significance. The exactness of the multipliers of the 
projective representations which induce « cannot of course be always 
taken for granted. But in many cases this is the case (cf. theorem 7.40). 

The equations (16) can be written formally in their infinitesimal forms. 
Let U be a fixed representation satisfying (16). Let g be the Lie algebra of 
G. For any X e g and each t, x —> exp(tX )-z is a diffeomorphism of M ; and 
as t varies, we obtain a one-parameter group. Let rx be the vector field 
of this one-parameter group. For any feC”(M) and xe M, we have 
(by definition), 


(17) (raf ia) = $ (flexptX)-2)).-0. 


For any real C” function f on M, let A, be the operator 


A 


(18) A, = J, fdP. 


A, is possibly unbounded but self-adjoint. Then, (16) leads to 
(19) U,A,U,~* = Ago. 


Now, t > Ucexptxy iS a representation of #1, and hence there exists a self- 
adjoint operator Bx, such that exp(—uByx)=U expt% for all t. By 
is determined uniquely and we have, formally from (19), 


[A,,A,-] = 0, 

[A,, Bx] = iA azm 

[Bx By] = tBix,yyp 
aBy+bBy = Baxsoy- 


We shall call the equations (20) the Hetsenberg commutation rules (cf. 
(10), Chapter I). 
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The observables associated with the operators A, and Bx which enter 
the relations (20) may be given interesting physical interpretations. We 
consider first the operators A,. We recall that for any set E, S(E) repre- 
sents the experimental statement that the configuration of S belongs to E. 
Now, from the formula (18) for A;, we see that the operator A, has the 
spectral measure F —> P,-1,, (F a Borel set of the line). Hence, if we 
denote by ¢ the observable corresponding to A,, the statement that the 
value of ¢ lies in F must be represented by the subspace S(f~1(F)). It is 
thus identical with the statement that the configuration of G belongs to 
the set f~1(F). In other words, if we observe, instead of the configuration 
x of ©, the value of f at x, we obtain an observable whose associated self- 
adjoint operator is A,. We shall therefore call these observables con- 
figuration observables. Note that the function f represents classically an 
observable with the same significance. Thus f— A, is an algebra homo- 
morphism which assigns to each classical configuration observable the 
“corresponding” quantum observable. For example, if M = R" and f is 
the kth coordinate function (f,,---,t,)—>4,, then A, is the operator 
which represents the kth configuration coordinate. 

On the other hand, the vector field ry gives rise to the classical mo- 
mentum observable corresponding to the symmetries t —> exp(tX) acting 
on M, and the analogies we have so far developed with classical mechanics 
lead us to define By, as the quantum mechanical momentum observable 
corresponding to the symmetries generated by X. In other words, the 
covariance expressed by (20) can be formally regarded as describing 
certain commutation rules between the configuration observables and the 
momentum observables of the system. We have thus arrived at the real 
mathematical meaning of the celebrated commutation rules of physics; 
at the same time our extensive knowledge of systems of imprimitivity 
enables us to understand why they are so far-reaching. 


3. THE SCHRODINGER REPRESENTATION 


Given M and G, the main problem of course is to determine the pairs 
(a,S) which satisfy the relations (15). This is, at least in the special case 
when a is induced by a representation of G, nothing but the problem of 
determining the systems of imprimitivity for G which are based on M. 
The analysis of Chapter VI shows that such a determination is com- 
pletely known when G acts transitively on M. Fortunately, in many 
physical applications, G is transitive on M, so that these results can be 
applied. On the other hand, whether or not G acts transitively on M, 
there always exist the Koopman systems of imprimitivity associated with 
the invariant measure classes on M. In this section we shall explicitly 
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describe these latter systems in the form in which they are most familiar 
in physics. 

We start with the C” manifold M and a Borel measure v on it. We write 
H = L*(M,v). For any real C” function f on M we define A, to be the 
operator of multiplication by f: ) 


(21) A,:p—>fo. 
For any C” vector field Z on M we define Bz as the operator 
(22) Bz: g > —iZp. 


Both (21) and (22) are meaningful only when 9 is suitably restricted, 
certainly when o e C,° (M). Now, if we define A, rigorously by (21) on the 
set of all » such that pọ and fp are both in % then it is well known that A, 
is self-adjoint and is the closure of its restriction to C,” (M). On the other 
hand, it is not always possible to make B, self-adjoint even if it is formally 
symmetric on C,°(M); the well known example is of course the one 
where M =]0,œ[, and Z=d/dx (v is Lebesgue measure). We shall now 
formulate the following useful lemma which describes a set of sufficient 
conditions for B; to be essentially self-adjoint when defined by (22) on 
C.” (M). 


Lemma 8.2 (van Hove [1]). Let t->g, be a one-parameter group of diffeo- 
morphisms of M such that t, x—>t:x = g(x) is C© from Rix M into M. Let 
Z be the vector field induced by t->g,. Suppose that v is invariant under all 
the gi. Then, the operator Bz is symmetric on the domain C{(M), and tts 
closure is self-adjoint in H. If we denote this closure by Bz also, then for 
any t, exp(—tBz) is the operator f>f% in H, i.e., 


(23) (exp(—w#Bz)f)(x) = f(g.~*2). 


Proof. By definition, (Zq)(x)=((d/dt)p(t-x));-9 for all eM and 
po EC,°(M). We begin our proof with the observation that a complex 
valued Borel measure v’ is invariant with respect to the g; if and only if 


(24) Í (Zp)dv' = 0 


for all p e C,” (M). In fact, let u(s)= | „ p(s-)dv'(x). If v’ is invariant, u(s) 
is independent of s and hence has vanishing derivative, proving (24). 
On the other hand, if u is as above, we have, 


T = |, (Zeve-zhde(a), 


= Í (Z(@~*))(«)dv' (a), 
M 
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so that (24) implies that u is a constant, giving the invariance of v’. This 
said, we come to the proof proper. For 9, 4 € C.” (M), 


Í plt-x)p(x)dv(x) = { playp(t--x)dv(e), 
M M 


since v is invariant. Therefore, we have, on differentiation, 


f (Zp)ļġdy = — f p(Zyp)dv, 


and it follows that the operator Bz : p —> —iZọ is symmetric on C,” (M). 
We shall now prove that the closure of Bz in # is self-adjoint. It is known 
from standard spectral theory that a necessary and sufficient condition 
for this is that for any nonreal complex number ¢, the operator B,+¢-1 
map C,” (M) onto a dense linear manifold of # Suppose that fo E H 
and 


—i Í (Zp)fodv+% f pfodv = 0 


for all p in C,” (M). We must prove that f,=0. Let p=Z/t. Then p is not 
purely imaginary and 


(25) f (Zoo = p | ofod 


for all p e C,” (M). 


Consider now the manifold M’=R'x M and the one-parameter group 
t —> g; of diffeomorphisms of M’, where 
gi (8,0) = (¢+8,t-2). 


Write Z’ for the corresponding vector field ; for u e C,*(R1) and peC,~(M), 
let u Q ¢ be the function s, x — u(s)p(x). Then, 


f| Zu @ ontemrerrfotnrdrtayds = |" FE eeds [ gofalai) 


, 


+ [mers | (Zp)(x)fola)dv(a) 


— © 


= 0, 


as is easily seen by integrating by parts the integral with respect to ds in 
the first term and using (25). A standard approximation argument (cf. 
Schwartz [1], pp. 108-109) now leads to 


f f (Z'w)(s,x)e°sfo(x)dsdv(x) = 0 
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for all we C,”(M’). In view of our observation at the beginning of the 
proof (cf. (24)) we conclude that the complex Borel measure À, where 


NE) = ff erfolerdsdr(a), 


is invariant with respect to the group t —> g,’. But dsdv is also an invariant 
measure. Hence for each t, we must have e+ f,(t- x) =e°*f,(x) for almost 
all s, x, i.e., 


folt-2) = e-f) 


for v-almost all x. Therefore, for each real t, 


f oae) = (f [folal2avtey) leat 


and since p is not purely imaginary, we conclude that f, =0. As mentioned 
before, this proves that the closure of the operator p > —iZp is self- 
adjoint. 

Now the operators V,, where 


(Vif) = ftx) (fe #), 


are unitary and V,=exp(—7tA) for a self-adjoint operator A. Further, 
Agp= Bzp whenever p € C,” (M). Hence A must coincide with the closure 
of Bz. This finishes the proof of the lemma. 


Consider now a connected Lie group G acting on M such that the map 
g, x —g-:x of Gx M into M is C”. Let v be a Borel measure invariant 
under G. We then take # = #?(M,v) and define 


Prp = xe‘, 
(U,p)(xz) = o(g- *2). 


(U,P) is a Koopman system of imprimitivity associated with v and G. 
If we take # as the Hilbert space of a system with M as its configuration 
space and G as its group of symmetries, then we see that the configuration 
observables are represented by the multiplication operators A, (feC*(M)). 
For any element X € g, the Lie algebra of G, the momentum observable 
By, corresponding to the symmetries x — exp(tX)-x of M is the differential 
operator —irx, where ry is the vector field on M defined by 


(26) 


(raphe) = (GARDD) 


t=0 
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the differential operator By = —irx is the self-adjoint operator which is the 
closure of the operator p —> —irxp on C,°(M) and Ue ix) =exp(—wtBx) 
for allt. The By and A, satisfy the commutation rules: 


[AA] = 0, 

[Bx4;] = tAgxpy 

[Bx, By] = 1Bix,y); 
aBy+bBy = Baxsoy» 


(27) 


where f, f’ € C”(M), a,b are constants, and X, Y eg; the equations (27) 
are valid on the dense linear manifold C,” (M). 

This particular representation of the equations (20) is known as the 
Schrödinger representation. From our point of view the relations (16) are 
the fundamental ones; the commutation rules (20) are to be regarded 
merely as their formal infinitesimal versions. The reason for the frequent 
appearance of the commutation rules in physical literature is simply due 
to the tradition in mechanics and physics according to which the laws 
governing physical systems were always expressed in the infinitesimal 
form whenever possible. Rigorous formulations of (20) can be given; 
but they get involved with the domain considerations inevitable in any 
considerations of unbounded operators, and the equations lose the global 
geometric flavor possessed by the “integrated” relations (16). The 
mathematical importance of the integrated versions of (20) seems to 
have been first recognized by Wey] ([1], pp. 272-276). 

When G acts transitively and freely on M, we may use theorem 9.17 
to conclude that every irreducible system of imprimitivity for G based on 
M is equivalent to the Koopman system. We may state this loosely by 
saying that every irreducible representation of the commutation rules 
(20) is equivalent to the Schrödinger representation. When M was R” and 
G was the group of its translations, this was first proved by Stone [3] and 
von Neumann [3]. Our method of proof, based on theorem 6.17, is that of 
Mackey [9]. This result means that the apparently special choices made in 
the Schrédinger representation, for representing the states and the 
position and momentum observables, are in fact the most general ones 
possible subject to the commutation rules (20), provided we assume 
irreducibility. We may thus regard this theorem as proving the iso- 
morphism of Schrédinger’s wave mechanics (based on the Schrödinger 
representation), and Heisenberg’s matrix mechanics (based on the 
commutation rules (20) only and not on any specific choice of the A, and 
By), and thereby unifying the two great developments. 

The irreducibility of the (U,P) can be given an interesting interpreta- 
tion. The system (U,P) is irreducible if and only if the elements in the 
commuting ring of (U,P) are the multiples of the identity or if and only if 
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the smallest von Neumann algebra containing the spectral projections of 
the By and A, is the ring of all operators in #. Let us define a self-adjoint 
operator A to be a function of the By and A, if A belongs to the von 
Neumann algebra generated by the Bx and the A,. Then every self- 
adjoint operator is a function of the By and the A,. On the other hand, 
if we consider the classical system with configuration space M, it is true 
that all the observables are functions of the configuration observables and 
the linear momenta. We thus see that the assumption of reproduci- 
bility corresponds (roughly) to the assumption that there are essentially no 
new observables other than the configuration ones and the linear momenta. 


4. AFFINE CONFIGURATION SPACES 


An affine space is a pair (M,V), where (i) V is a real finite dimensional 
vector space, and (ii) V acts freely and transitively on M. By the usual 
abuse of language we shall refer to M itself as an affine space. V is said to 
be the vector space associated with M. For ve V we write t, for the 
transformation of M induced by v and refer to it as the translation by v. 
We write 

ty x = X+0. 


Given x and y in M, we shall write y — {x for the unique element v of V such 
that t,-2=y. Since ¢_, and t, are inverse to each other, y—x= —(x—y), 
while the relation t,,, ,,-=t,t,- leads to the equation (x— y)+(y—z)=2x—z. 

A one-one map L of M onto itself is called an affine automorphism of M 
if there exists a linear automorphism L° of the vector space V such that 
for all x and y in M, 


(28) La— Ly = L(x—y). 


Since for fixed x, y —> (x—y) maps M onto J, it is clear that L° is uniquely 
determined by L. If L? is the identity, we shall call L a translation. If 
ve JV, it is trivial to check that t, is a translation. Conversely, let L be a 
translation and let x and y be two points such that y= Læ. If we write 
y=x+v, then it follows from the equation Lz, — Lz,=2, —2, that Lis the 
translation x —> x +v. If we choose a point xo € M and map M bijectively 
onto V by the map v > x +v, then the affine automorphisms of M are 
easily seen to be precisely those which go over to the inhomogeneous 
transformations 


(29) a —> L°a+b 


of V. Thus, in the notion of the affine automorphisms of M, we have an 
invariant description of the inhomogeneous linear transformations which 
are associated with vector spaces. 
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If we choose a basis v,,---, v, for V, and a point 2) E€ M, then the map 
(30) (ti, a) t,) —> To + (> t) 
j=1 


is one-one from R” onto M, and its inverse maps M onto R” bijectively. 
We shall call the inverse of (30) an affine coordinate system on M with 
origin in £o. Suppose Xp, zı E M, {v,,---, va}, {Wy,--+, Wa} are two bases 
for V, and y, y’ the affine coordinate systems of M associated with zo, 
{v,,°°', Un} and 2%, {w,,°°°,w,}. Then there exists (da,,°::,@,)E€R" and 
an invertible matrix (d,;)",., such that, for any xeM, the images 
(teesta) = y(x) and (t,t, ta) =y (x) are related by 


(31) t = > dit +t. 
j=1 


We thus obtain the usual formulas which connect the two coordinate 
systems. From the formula (31) it follows that there is a unique C® 
structure on M such that each affine coordinate system is a diffeomorph- 
ism of M onto R”. With this C” structure, the map v, x > x+v is 0°” 
from V x M into M. 

It is obvious that M admits an invariant Borel measure, say dm. 
dm goes over to a multiple of the Lebesgue measure in R” under any affine 
coordinate system. From our analysis in Section 2 we conclude that the 
only irreducibly covariant systems with M as their configuration space are 
described by the Schrödinger representation. The Hilbert space of such 
systems can be taken to be ¥2(dm). The configuration observables are then 
represented by the multiplication operators. If ve V, the momentum 
observable corresponding to the symmetries x —> x +tv is represented by 
the operator B,: 

(32) B, = —10(»), 

where O(v) is the usual linear differential operator associated with v. 
We shall call this the linear momentum observable in the direction of the 
vector v. 

If we choose now a point x, € M, a basis {v,,---, v,} of V, and map M 
onto R” with the help of the corresponding affine coordinate system y, 
we can obtain concrete forms for these results. For simplicity of descrip- 
tion we identify M with R” and # with @*(R"). Then the kth coordinate 
function, say q,, is represented by the operator of multiplication by tẹ: 


(33) Ik ~ p(t, ney tn) —> tpt, mr, tn), 
and the linear momentum in the direction of the jth coordinate axis, say 
P; is represented by the operator —1(0/d¢,): 


oO 
(34) Pi ~ plti ees bn) > ae ltrs: + +5 n). 
J 
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The second of the commutation rules (20) then becomes 


(35) [GPs] = (—1)76,,1, 


which is the form in which the rules were stated first by Heisenberg. 
If ọ is a unit vector in % i.e., 


(36) f olePam(a) = 1, 


then the probability distribution q, of the configuration of the system in 
the (pure) state represented by o is given by 


(37) qo(E) = i p(x) [2d m (2). 


Since we are identifying M with R” using a coordinate system, the 
formula (37) becomes 


(38) qe(E) = f, lolt: tg t,)|?dt, - i dt,- 


The formulas (34) and (35) were first postulated by Schrödinger when M 
was three dimensional and the system to be analyzed was that of a single 
particle. Formula (34) gave the forms of the position operators, (35) the 
forms of the components of the linear momenta. Formula (38) gave the 
probability that in the state represented by 9, called the wave function 
of the particle, the particle is to be found in the region E and was first 
recognized by Max Born. From our point of view, we see that the whole 
development falls out of the assumption of affine covariance and the 
irreducibility of the system (U,P). 

The special forms of the configuration and momentum observables can 
also be used to illustrate the phenomenon of complementarity so peculiar 
to quantum theory. Suppose # U, P are as above. Let us choose an 
affine coordinate system for M and identify M with R” and # with 
£*(R") via this coordinate system. The expressions for the configuration 
observables show that they commute among themselves. On the other 
hand, the translation operators U, also commute among themselves, so 
that the momentum operators B, commute among themselves. Let us 
now take a function pe #?(R") of unit norm and compute the probability 
distributions of the vector observables q=(q,,---, qa) and p=(j,--+; Dn): 
For q, this is straightforward; it is 


(39) E >f plti t’ tn)| dt; : _ dtn 
E 
To compute the distribution of p we must use Fourier transforms. Let 


(40) F :p—ý 
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be the Fourier-Plancherel transform of #(R") onto itself which is the 
unique unitary automorphism of £?(R") with the property that for all 
yeC,(R”), 


A 1 . 
(41) P(t, mers tn) = (2m)"2 f exp — [tx + mee +£,2n] 


x p(z: a) L,)dx, - _ dx, 
for all (t,,---,¢,)¢R* (Bochner-Chandrasekharan [1]). It is then easy to 
show that for v=(v,---,v,), FU, Ft is the operator of multiplication 


by exp [—12(v,f, +--+ +4,¢,)]. From this it follows easily that the spectral 
measure of B, is given by F > F~'P,,F, where 


(42) Fy = (by es ty) i Viti +--+ +0, t, © PS 


(F a Borel set on the line). From (42) we see that the probability distri- 
bution of p in the state given by 9 is 


(43) B> [1a s i) Pdia + ay 
E 


Formulas (39) and (43) show the remarkable statistical connection between 
the configuration observables and momentum observables. If p is such 
that œ vanishes outside of a small region S, the configuration of the system 
is localized in S statistically; formula (43) then tells us that the distribution 
of the momenta is very diffuse. Conversely, states » for which ¢ has a 
small region for its support localize the momenta; but the configuration 
observables will have diffuse probability distributions. 

No single state exists in which both the position and the momenta can 
be localized sharply and the same system exhibits both types of properties 


in appropriate states. 


5. EUCLIDEAN SYSTEMS: SPIN 


We shall now discuss the situation when G is transitive on M but its 
action is no longer free. The different systems with M as their configuration 
space and G as their symmetry group will now be labeled by representa- 
tions of the stability groups. In the special case when M is a three- 
dimensional Euclidean affine space and G is the covering group of its 
group of motions, this leads to the kinematical classification of such 
systems in terms of their spin. 

A Euclidean affine space is a triple (M,V,<.,.>), where M is an affine 
space with associated vector space V, and <.,.> is a positive definite 
inner product on V x V. By the usual abuse of language we shall refer to 
M itself as a Euclidean space. An affine automorphism L of M is called 
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a motion of M if the associated linear transformation L? of V is orthogonal 
with respect to ¢.,.> and has determinant +1. The set of all motions of 
M is a group, denoted by G. For each v e V, the translation t, : x —> 2+ 
belongs to G and v — t, is an isomorphism of V onto a normal subgroup 
To of G. For any x e M we write G, for the stability subgroup of G at x. 
Given 2 E€ M and a basis {v,,---, va} (~=dim V) of V, the affine coordinate 
system relative to these is called a Huclidean coordinate system, if 
{vis ++, Yny} is an orthonormal basis for V. It is clear that if we map M 
onto R” using a Euclidean coordinate system, G goes over to the inhomo- 

geneous SO(n)-group in R” and G,, goes over to the rotation group 

SO(n), where x, is the origin of the coordinate system. From this observa- 
tion we can obtain at once the structure of the group G. For any xe M, 


(44) G = TG, To A Gh, = {identity}. 


Moreover, there is a canonical way to convert G into a Lie group. To and 
G, are closed, and G, is compact. If n>2, G is connected. If n>3, 
G, is semisimple and its universal covering group is also compact. We 
shall choose a universal covering group G* over G and denote by è the 
covering homomorphism of G* onto G. 6 is an isomorphism of 7'5*, the 
component of the identity of 6~+(7'9), onto To; on the other hand, for 
any xe M, 6-1(G,)=G,* is connected, simply connected, and covers Gy 
(through ò). G* is a semidirect product of T,* and G,*. 

Suppose we consider a quantum mechanical system © for which a 
Euclidean affine space M is the configuration space. Assume the dimension 
of M to be >3. For a description of GS which is covariant with respect to 
the group G of motions of M, we must introduce a pair (U,P), where 
P(E —> Pp) is a projection valued measure based on M and U a projective 
representation of G such that for all g and £, 


(45) U,P,U,~* = Paier 


Both U and P act in #, the Hilbert space of &. Now the group G satisfies 
the conditions of theorem 7.40 and hence we know that the universal 
covering group G* of G has only exact multipliers. We may thus assume 
that the projective representation U arises from a unique unitary rep- 
resentation V of G* in #. V and P will then satisfy the relation 


(46) VoPEV g~? = Page (g* € G*). 


The action g*, x —> 5(g*)-x converts M into a transitive G*-space and (46) 
tells us that (V,P) is a system of imprimitivity for G*. (V,P) is irreducible 
if and only if the pair (U,P) is so. Conversely, suppose (V,P) is an irreduc- 
ible system of imprimitivity for G* based on M, acting in # Then the 
equation (46) shows that for any element g* in the kernel of ô, V, 
commutes with all P,; but V, also commutes with all the V, (x* e G*), 
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since g* lies in the center of G*. Hence the irreducibility of (V,P) will 
imply that V maps the elements of the kernel of ô into scalars. From 
this we at once infer the existence of a projective representation U of G 
such that x* > U xz» and x* —> V, will define the same homomorphism 
of G* in the projective group of #. 

Our problem is thus reduced to the determination of the irreducible 
systems of imprimitivity for G*, based on M. Unlike the situation 
involving only the translations, there exist now irreducible systems 
(V,P) which are not unitarily equivalent. From the general theory 
developed in Chapter VI we know that the unitary equivalence classes of 
our pairs (V,P) are in canonical one-one correspondence with the equiva- 
lence classes of irreducible representations of the stability subgroup G," 
at some point 2,¢M. However, the relation of unitary equivalence 
between the systems (V,P) is in general finer than the relation of physical 
equivalence; we shall call two systems of imprimitivity for G* based on M, 
say (V,P) and (V’,P’), acting in # and #”, respectively, physically 
equivalent if there exists a unitary or antiunitary isomorphism W of # 
onto #” such that (i) P= Wo P'o W~?, and (ii) the representations V’ 
and Wo Vo W~—! of G* in #” give rise to the same homomorphism into 
the projective group of #’. Notice that as G* has no one-dimensional 
representations (cf. theorem 7.40), (ii) is equivalent to requiring that 
V'=W o V o W-1, If W is unitary we are back to unitary equivalence. 
But if W is antiunitary, the relation of physical equivalence will in general 
not imply unitary equivalence. We have, in fact, the following elementary 
result which clarifies the connection between unitary and physical 
equivalence. 


Lemma 8.3. Let X be a transitive Borel H-space for a lcsc group H and 
let xo E X be an arbitrary point. Let H, be the stability subgroup at xo. 
Suppose that m, (j=1, 2) are two unitary representations of Hy. Let (V7, P?) 
be the associated systems of imprimitivity acting in H1, say, j=1, 2. Then, 
wn order that there exist an antiunitary operator W of #| onto Æ? such that 
V2=W o V10W~! and P?=W o Pto W~), it is necessary and sufficient 
that m, and m, be contragredient to each other. 


Proof. We choose a quasi-invariant measure a on X. Let 47 be the 
Hilbert space in which m, acts. Then, according to the general theory of 
Chapter VI, there is a strict (H,X)-cocycle gf, whose values are in the 
unitary group of #7! with (V‘,P/)~(V!,P’), where (V!,P4) is associated 
with gy’ and a and acts in £7(X,47,a). We choose a fixed (but otherwise 
arbitrary) antiunitary automorphism S of X? and write S~ for the map- 
ping of $?(X,%1,a) onto itself defined by 


(47) (S~f)(x) = Sfx) (xex). 
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S~ is evidently an antiunitary automorphism of #7(X,%1,0) and 
S~ P1S~-1= P}, as may be easily calculated from the formula P,'f=y,f. 
We write 


(48) U1 = 8~P1s~-}, 


(U1, P2) is also a system of imprimitivity for H based on X. Moreover, 
using the formula (39) of Chapter VI it can be easily calculated that U1 
is the system associated with the strict cocycle øt, where 


(49) @' (9,2) = Sp (gx). 


Thus there exists an antiunitary isomorphism W of #2(X,%1,a) onto 
L (X, X? a) such that WV! W-1= F? and WP!W-1= P? if and only if 
there exists a unitary isomorphism W’ which sets up an equivalence 
between (U1,P1) and (V?,P2); in fact, W’ and W are related by the 
equation W’=WS~~1. Hence a necessary and sufficient condition that 
there should exist an antiunitary operator W such that W o Vio W-1= p2? 
and Wo P! o W-t= P? is that the cocycles g! and q? define equivalent 
representations at Zp, i.e., the representations m, and § o m, o S7} are 
equivalent. Now there exists a canonical antiunitary isomorphism S, of 
X with its dual which intertwines m, and its contragredient m,°, i.e., 
m,=S,~1m,°S,. Thus our necessary and sufficient condition reduces to 
the equivalence of m, and (SS,~1) o m,° o (SS, ~1)7}, i.e., of m, and m°, 
since SS, 7+ is unitary. This proves the lemma. 


Applying this lemma to the case with which we are concerned (since 
G,,,* is compact, its irreducible representations are all finite dimensional), 
we conclude that the physical equivalence classes of irreducible systems 
(V,P) are in one-one correspondence with physical equivalence classes of 
representations of G, ,*, where two representations of G,,* are defined to be 
physically equivalent if they are either unitarily equivalent, or are contra- 
gredients of each other (it is easy to verify that this relation is an 
equivalence relation). In this sense, therefore, we have arrived at the 
complete solution to our problem. 

All that remains is to obtain the explicit expressions for the systems of 
imprimitivity. We use dm for the measure on M which is invariant under 
the translations. It is obvious that dm is invariant under the motions of 
G,,* so that it is invariant under the action of the whole of G*. To com- 
plete the construction, we must determine, for each irreducible repre- 
sentation s(h —> s(h)) of G,,*, a strict (G@*,M)-cocycle g with values in the 
unitary group of the Hilbert space “ in which s acts, such that p° defines 
the representation s at 2. We shall construct g* in the following way. 
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Since To* is normal in G* and G*=T)*G,,*, we have, for any g* e G*, 
the unique decomposition 


(50) g* = t(g*)r(g*)  (t(g*) e To*, r(g*) € Gr"). 


The equation g,*g.*=t(91*)[7(91*)t(go*)r(gi*)~ *Ir(91*)r(g2*) shows at 
once that 


(51) r(gi*go*) = r(g1*)r(g2*) = (91*, Ga* E Gro”). 
Moreover, for g* in G,,*, 
(52) r(g*) = g*. 


(51) and (52) show that, if we define g: by 
p°(g*,x) = s(r(g*)), 


then ¢° is a strict cocycle of the type we want. 

We are finally in a position to describe the irreducible pairs (V,P) 
explicitly. Let s(h —> s(h)) be an irreducible representation of G,,* in a 
(finite dimensional) Hilbert space % and let 


(53) H = LPM, X dm). 
For Ec M and g* e G* we define 


Pip = xe $, 
(Vape) = s(r(g*))A(8(g*) =- x). 


Then (V%,P%) is the system of imprimitivity corresponding to the 
irreducible representation s of G,,*. 

The most important special case of the preceding discussion arises when 
M is three dimensional. We shall now proceed to consider this case in 
somewhat greater detail. We select a Euclidean coordinate system y 
with origin xz). Using y, we shall identify M and V with R?; the bilinear 
form on V x V becomes the usual one. G will then be identified with the 
group of all rigid motions of R?. G,, goes over to the rotation group 
SO(3). We write Go for G,, and G* for G,,*. 

We shall first construct an explicit covering of Go by G)* following 
the well known construction of Weyl ([1], pp. 143-146). We introduce the 
Hermitian matrices (Pauli spin matrices) 


5s pı [p =i fl 0 
( ) 0, = ( o) 03 = (; s) 03 = k i) 


and the correspondence x —> A(x), where x eR? and 


(54) 


(56) A(x) = L107} + T203 + X303. 
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Let 
(57) G)* = SU(2,C). 


Then for g* e Gy*, A(x) —> (g*)A(x)(g*)~} is a linear transformation of the 
vector space of all 2x2 Hermitian matrices of trace 0. Hence, via the 
correspondence (56), we obtain a linear transformation 6(g*) of R°: 


(58) A(8(g*)x) = g*A(x)g*™°. 


Since det A(x)= — (x,? + £3? + £3?) is preserved under unitary similarities, 
d(g*) is an orthogonal transformation of R?. It is well known that 
6(g* —> 5(g*)) is a covering homomorphism of G)* onto Gp. Since the 
group G consists of elements (g,v) (v e R?, g e G) with the composition 


(59) (9.v)(9'v') = (ggv +g: y’), 

we may define @* to be G" x R? with the composition 
(60) (g*,v)(g* v") = (g*g* v + 5(9*)v’). 
The map 

(61) ô : (g*,v) > (8(9*),v) 


is a covering homomorphism from G* onto G. | 
It is useful to describe the one-parameter subgroups of G)* which map 
onto the familiar rotations around the axes. An easy computation leads 


to the following. If t ~ g,*(t) are the three one-parameter subgroups of 
Go* defined by 


e*2 Q cos ¢/2 isin t/2 
ga*(t) = ( pua)” gi*(t) = ( ) 


0 i sin t/2 cos t/2 
(62) 
cos t/2 sin t/2 
art = (2 a)? 
—sint/2 cos t/2 
then t —> 8(g,*(¢)) =g,(t) is given by 
cos £ sin £ 0 1 0 0 
ga(t) = | —sint cos t 0 |, g,(t) = | 0 cos ¢ sin £ }, 
0 0 l 0 —sint cos t 


[cost 0 —sintż 
galt) = £ 1 0 
sin t 0 cos t 


The theory of the irreducible representations of @,* is well known 
(cf. Weyl [1], pp. 143-180). The equivalence classes of these representa- 
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tions are labeled by a parameter j which takes the values 0, 4, 1, $,- ++. 
For a given j, the associated representation, say D’, is of dimension 
27+1, and there exists an orthonormal basis in the representation space 
of Di, say {v,, : v=j,7—1,---, —Jj}, such that 


(64) D4(93*(t))vjy = ely, 


The basic representation g* —> g* corresponds to j = 4. The representations 
D’ for integral j map the kernel of è (={ + 1}) into the identity, and hence 
induce representations of Go. The other D’ induce only projective repre- 
sentations of Go. 

An explicit realization of the D’ can be obtained as follows. We regard 
G,* as operating in the obvious manner on C?=W and consider the 
tensor product W®%=W @---@ W (N factors, N=2)). Then W®” is a 
Hilbert space of dimension 2”. If we define, for g* € Go*, m(g*) by 


(65) n(g*) = J @---@g*, 


then m(g* —> m(g*)) is a representation of G* in W8”. x leaves the space of 
symmetric tensors invariant. Let W/ be this subspace. Then W’ is irreduc- 
ible under z, and the representation defined by W/ is equivalent to D’. 


If e= (5). the reader can trivially check that 


(66) m(93*(t))(e; @- + O e) = ele @---@ e). 


Finally, as the representations D’ have different dimensions for distinct 
values of j, and as mutually contragredient representations have the same 
dimension, we see that each D’ is contragredient to itself. In view of our 
earlier discussion, the physical equivalence classes of the pairs («,S) 
satisfying (15) depend on the parameter j (j=0, 3, 1,---), and for any 9, 
(a,S) is the pair obtained from the corresponding system of imprimitivity 
induced by the representation D’ of G*. 

The Hilbert space of wave functions of a typical system which is 
irreducibly covariant under the Euclidean group of R? can thus be 
regarded as the space #7 of square integrable Borel functions on R°? 
whose values are symmetric tensors of rank (N =2j) over the two- 
dimensional complex space W. Let W’ be the Hilbert space of symmetric 
tensors of rank 2j over W with norm |-| and inner product ¢.,.> induced 
naturally from W. Let Dé denote the representation of G)* in W’. Then, 
for the elements 4 of #7’, we have 


(67) f yd < o, 
and for any element (g*,v) of G*, we have 
(68) (Voge, w(x) = Di(g*)$(3(9*) x — 8(9*) ~v), 
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while P, is, as usual, the operator 4 —> x,y. The operators of V induce the 
lattice automorphisms corresponding to the elements of G. 

The formulas (67) and (68) are the usual ones in physical literature 
which express the “transformation of wave functions when the frame is 
rotated.” Note that if (g,v) is any Euclidean motion, (68) tells us that we 
must first transform into the function x —> $(g~'x—g7+v), and then 
multiply the result by the matrix D’(g*), where g* lies above g. There is of 
course an ambiguity in the choice of g*; but this only changes D’(g*) into 
+ Di(g*) and the resulting lattice automorphism depends only on g. 
For integral values of j, there is no need to go to G*, as D’ defines already 
a representation of Go itself. 

From (68) we can obtain the expressions for the various momenta 
corresponding to the elements of the Lie algebra of G. If ve R°, and we 
consider the translations x > x +tv of R3, we obtain, for the corresponding 
linear momentum B,, the expression 


(69) B, : 4 —> —10(v)¢, 


where ô(v) operates componentwise. This is exactly the same as in (32). 
On the other hand, we now have in addition the angular momenta, which 
correspond to the various one-parameter subgroups of Go*. Let X,, X2, 
and X, be the elements of the Lie algebra of Go* which correspond to the 
one-parameter groups gı, go, and gz, respectively. If M,, M3, and M, are 
the corresponding momentum operators, we have, from (62), (63), and 
(68), for r=1, 2, 3, 


(70) M yp = i{0, + Di(X,)p}, 


where 


ð ð 
01 = — T3 Ja, 2 ax. 


0 o 
(71) da = ta gy T ga 
9 o 4 ra 
37 Lo 3 Tı Oxy 
and 
(72) DX,) = (5 Di(g,*() 
dt t=0 


It is usual to write M,° for iô, and M, for the operator 4 —> iD#(X,)d. 
M, is called the angular momentum around the x,-axis. The operators M,° 
and M, are actually defined by the relations 

exp(—itM,°) = Ay, 

exp(—iwM,5) = By, 
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where t—> A,’ is the one-parameter group defined by (A,"/)-(x)= 
w(5(g,*(t))~ 1x), and t—> Bë is the one-parameter group defined by 
(B,"b)(x) = D?(g,*(t))b(x). Since A,’ and B} commute, so do M,° and 
M3: 


(73) [M,°,M,3]=0  (r,k = 1,2,3). 


M,° is called the orbital angular momentum around the z,-axis, and M, 
is called the spin angular momentum around the z,-axis. By lemma 8.2, 
M, is essentially self-adjoint on C,°(R°,W); M, is bounded. Moreover, 
M, is also essentially self-adjoint on C,°(R3,W). 

We note that the group )*, as well as its one-parameter subgroups, is 
compact. Hence for any element X of the Lie algebra of Go*, the operator 
Bx (defined by exp(—ttBx) = Veexp ix) Will have a discrete spectrum. From 
the formulas (62) it follows at once that the angular momentum observables 
M, have only the half integral numbers 0, +4, +1, +3,--- as their 
possible values. This was one of the earliest facts recognized in quantum 
mechanics and emphasized by Niels Bohr (cf. Weyl [1], pp. 185-191). 

The formulas (54) show that insofar as the translations of M are con- 
cerned, we have the usual system of imprimitivity for the translation 
group. However, if 70, this restricted system is no longer irreducible. 
Therefore there will be observables which will be simultaneously measur- 
able with respect to the linear momenta and the configuration observables. 
The operators M,°, M,*°, and M,° correspond to such observables as can be 
directly verified from the equation 


M, 5 = iD(X,)yp. 


These are usually called the spin observables. Note that each M, acts 
only on the components of p, and not on the argument x of p. The possible 
values of M, are just 27 +1 in number; these are j, 7 —1, 7 —2,---,—/. 
The parameter j, which enters so basically in the description of the system, 
is called the spin of the system. 

The case 7=4 is of special interest. In this case, the y’s map R3 into 


C?, so that each ẹ can be written as (5). where ý, and p are square 
2 


integrable on Rê. A brief calculation then shows that the operators M, 
are described by 


(74) M: ifs —> — 20,p, 


where oj, a2, and o; are described by (55). The observables M,* have only 
the two possible values + 4. 

With this we have completed what might be called the kinematical 
description of covariant systems with a Euclidean configuration space. 
We have obtained the usual expressions for the familiar observables and 
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shown that the introduction of the spin observables is an inevitable 
consequence of the geometry of the configuration space. This seems to be 
the mathematical meaning of spin. 


6. PARTICLES 


We shall now use the analysis given in Sections 1 through 5 to develop 
some of the formulas and expressions which are at the heart of non- 
relativistic quantum mechanics, namely those dealing with the mechanics 
of one particle. We shall, however, not go into many details. In the next 
chapter we shall make a deeper examination of the relativistic nature of a 
particle. Our present remarks are thus intended to be brief and serve 
only the purpose of a formal description of a few standard facts of 
quantum mechanics. 

We shall first examine the usual description of a “free” particle. 
Clearly one would like to associate with a free particle a configuration 
space M and therefore give meaning to statements about the position 
of the particle. At the same time, the basic nature of such a system 
requires a covariance with respect to the whole group of motions of M. 
We assume that M is a three-dimensional Euclidean affine space. Then the 
results of Section 5 may be applied to yield the result that these systems 
are labeled by the spin parameter j. To specify the system completely we 
must determine also its Hamiltonian. Now the Hamiltonian H cannot be 
an arbitrary self-adjoint operator since one would like our system to 
possess the usual rules of conservation of momenta. According to our 
principles (cf. Section 1), the requirement that the momenta be integrals 
of motion leads at once to the condition that the dynamical operators 
exp(—itH) commute with the operators V, which correspond to the 
various elements g* of G* (cf. (13), (14)). 

Let us now determine the most general self-adjoint operator H such 
that exp(—7tH) commutes with all the operators of the representation V. 
We consider the case when the spin is 0. We choose a point 7) € M and a 
Euclidean coordinate system with xo as origin. We can then identify M 
with R? and the Hilbert space of the system with Y*(R°). If (g*,v)eG* 
and ye ¥?(R>), 


(Voge, w¥)(x) = ¥(A(g*)(x—v)) (xe R?). 


Let F: y -> ý be the Fourier-Plancherel transformation of 2?(R*) onto 
itself (cf. (41)). Then a straightforward calculation shows that 


(75) (FV gs wyF px) = exp[—1t¢x,v>]- p(8(g*) x). 
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Let K= FHF -+. Then the fact that K commutes with all the operators 
FVat implies that K is the operator of multiplication by some 
Borel function k (possibly unbounded). If we now use the condition that 
K commutes with all the operators F V oF 1, we can conclude that 
for each rotation g, k(x)=k(g~*x) for almost all x. From this it follows 
quite simply that k must coincide almost everywhere with a Borel function 
of x? +22 +232. We may therefore conclude at once that H must be a 
Borel function, in the sense of the functional calculus of spectral theory, 
of the Laplacian, V: 


3? 3? 0? 


A similar argument shows that, conversely, for any Borel function k,(V), 
the operators exp[—itk,(V)] commute with all the operators of the 
representation V, so that in the system with Hamiltonian k,(V), all the 
linear and angular momenta will be conserved. 

Covariance with respect to the group of motions of M cannot therefore 
pinpoint the Hamiltonian H of a free particle other than up to a function 
of V. The usual form of H can be obtained only after we introduce the 
notion of relativity. If we demand that the system be covariant (in a 
suitable sense) under the full space-time Galilean group, then one can 
show that the only possible Hamiltonians are of the form 
(77) H = -L V 

2m ” 
where m is a real number; we choose m >Q so that the spectrum of H is 
bounded below (positivity of energy). We shall not go through the 
calculations here; we shall prove (77) in the next chapter, where we take 
care of the case of arbitrary spin also. 

The formula (77) is truly remarkable; for, if we notice that the operator 
p,= —1(0/0xz,) corresponds to the linear momentum along the 7z,-axis, 
then (77) can be rewritten as 


l 
(78) H = om (P1? + po? +p”). 


Formula (78) is also true if we interpret both sides classically; H is then the 
well known Hamiltonian for a free particle of mass m. Thus equation (78) 
further deepens the formal analogy between classical and quantum 
mechanics. 

If we are dealing not with a free particle but one moving in a potential 
field, then it is natural to assume that the expressions for the momentum 
and position observables retain their validity. But the expression for 
energy will now have to be different. It is natural to retain (78) for the 
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kinetic energy of the particle and add to it a term involving the potential 
energy. Now the potential energy must be regarded as a function of the 
position of the particle only and hence must be represented by an operator 
which is a function of q1, q2, and q3. We thus arrive at the expression of the 
Hamiltonian for a particle moving in a potential field: 


l 
(79) Hs; = om (P? + po? + ps”) +8(q1,90,9s), 


where S(q,,92,93) is the operator of multiplication by the function S on R°; 
the operator S represents the potential energy of the particle. Strictly 
speaking, one must be rather careful about (79), since we are adding to 
the cperator (1/2m)(p,7+ 927+ 37) another self-adjoint operator of 
possibly unbounded character. However, in a large number of physically 
interesting cases there is no ambiguity about the definition of Hs, as a 
self-adjoint operator. The formula (79) would then complete the description 
of the one-particle systems. 

Consider now a special case when S is a C® function and of a sufficiently 
simple type so that Hg, is essentially self-adjoint on the space of C°” 
functions with compact supports. If we now use the Heisenberg picture 
and compute the time derivatives of the p,’s and q,’s we find, from (79) 
and (14), on C,*(R°), 


Ii 


Dr = oH sqr, 


(80) 
Å, = ƏH ,/Op,, 


where 0H,/0q, is the operator of multiplication by @S/dq,, and ƏHşləðp, 
is the operator (1/m)p,. These are the analogues of the classical Hamilton- 
ian equations in their operatorial form. They formed the point of departure 
for Heisenberg’s classic approach to the problems of the atomic systems. 
On the other hand, (79) was the starting point of Schrédinger’s investiga- 
tions. These two approaches, together with the statistical interpretation 
of Max Born (cf. (39)), were the beginning of the epoch which led to the 
modern revolution in physics (cf. Wey] [1]). 

We conclude with two remarks bearing on our special assumptions on 
the configuration space. Our main assumption is that M is a Euclidean 
affine space. This in fact was decisive since it determined the structure of 
its group of motions and cleared the way for the application of the main 
theorems dealing with systems of imprimitivity. However, there are other 
interesting classes of spaces on which the motions act transitively; for 
example, the Riemannian manifolds with constant negative curvature 
(cf. Helgason [1]). In this case, it would be of interest to carry out an 
analysis analogous to what we have done, at least when the curvature is 
small. In particular, it would be of interest to compute the spectra of 


KINEMATICS AND DYNAMICS 315 


various important Hamiltonians as functions of the curvature e, for small 
e<Q. 

Our second remark concerns the existence of configuration space. This 
led to our being able to calculate, for any state of our system, the 
probability that a particle is found in a specified region of the space M. 
However, there are interesting physical arguments which prohibit, for 
certain particles like the photon, the possibility of experimentally verify- 
ing whether they are in a specified region of space. Such particles cannot be 
described, even kinematically, by the considerations of this chapter. We 
shall examine these questions from a different and more unified point of 
view in the next chapter. 


NOTES ON CHAPTER VIII 


1. Quantization. Roughly speaking, Quantization refers to a process by 
which one establishes a “correspondence” between certain classical and 
quantum systems. The mathematical requirements of this correspondence 
are, however, capable of being formulated in many ways and so there are 
several approaches to quantization. Historically it was Niels Bohr who first 
understood the importance, and even the necessity of constructing such 
correspondences. His celebrated ‘‘correspondence principle” was based on 
his discovery that for electron orbits with “large” quantum numbers, his 
new frequency energy relation approximated quite closely the classical 
relation; here large means large in relation to 2. Thus it became possible to 
view Quantum Mechanics as a generalization of classical mechanics whose 
results were approximated by those furnished by classical theories when fi, 
now treated as a parameter, tends to 0. With the discovery of Quantum 
Mechanics by Heisenberg, Schrédinger, and Dirac, the relationship between 
classical and quantum theories became clearer. In the Schrédinger repre- 
sentation, the classical state space is the phase space R?” = R” x R” while 
the quantum Hilbert space is L?(R"), and one “associates”? to classical 
observables which are functions of q1,.--,Qn. Py+--.Pa, the quantum 
observables obtained by the correspondence 


g;>Q = multiplication by q;, 
pO 
p;>—i ry = P, 


However, since the P, and Q; do not commute, such correspondences are 
not precisely defined even at the formal level, and so one needs additional 
prescriptions. The first rigorous quantization appears to be that of Hermann 
Weyl [1] (pp. 272-280) who associated to the classical function 


a= alqi» s. ln Pis ees Pn) 
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the operator A where 


A= fac, see Uns Vis eee Un) 
x exp{ih (u, P + -+U Pn HORI +. +nQn)} dur.. dundo... dVn, 


where å is the inverse Fourier transform of a and the exponential is inter- 
preted via the unitary representation of the Heisenberg group. Since then 
this theme has been examined repeatedly and from many other points of 
view. It is natural to replace the classical phase space R?” by a symplectic 
manifold of dimension 2n, or at least by the cotangent bundle of a “ con- 
figuration manifold ” of dimension n. The questions involve the differential 
geometry and cohomology of these manifolds as well as the analytical 
aspects of symplectic geometry and Fourier transforms. The following is a 
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2. Second Quantization. This is a framework for treating systems of 
identical particles from the quantum mechanical point of view and studying 
their behavior when the number of particles is large. It goes back to 
P. A. M. Dirac’s famous paper on radiation theory (Proc. Roy. Soc. London, 
A, 114 (1927), pp. 243-265) and is fundamental in quantum statistical physics 
and quantum field theory. If % is the Hilbert space associated with a single 
particle, then second quantization associates to # a Hilbert space # in 
an invariant manner; # will be the space corresponding to an unlimited 
number of particles identical with the original one and will have additional 
structure determined by the existence of certain operators. The key features 
present in # may be described as follows: 
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(a) The operator N=the number of particles. There is a self-adjoint 
operator N with pure discrete spectrum and eigenvalues 0,1,2,...; if H, 
is the.eigenspace for the eigenvalue n, states corresponding to the rays in 
H , are those in which the system has exactly n particles. Also 


H =H. 


The vectors of #,, may be denoted by |n); thus N|n)=n|n). 
(b) Vacuum state. The space #, is one dimensional, 


dim (#,) = 1. 


The state corresponding to this space is called the vacuum. 

(c) Occupation numbers: The operators N(¢). If (¢,);>1 is an orthonormal 
basis for % and (n,);>, are integers >0, there are unit vectors, denoted by 
|(n,)), with the property that for any t, n, particles are in the state g,. The 
n, are the so-called occupation numbers. This is formulated by requiring the 
existence of self-adjoint operators N(g) with pure discrete spectrum 
consisting of 0, 1,..., the number of particles in state p. The N (p;) commute 
among themselves, N (p;)|(n:)} =7,|(n,)), and for any n>0, the |(n;)} with 
nı +... =n form an orthonormal basis for #,. 

(d) Creation and Annihilation operators. For each vector ¢ in the one- 
particle space 4% there is an operator A(p) in #, which “annihilates” one 
particle in state p, and its adjoint A(g)', which “creates” one particle in 
state p. These operators could be unbounded. We require 


A(y)'A(y) = N(¢). 


If (y,) is an orthonormal basis of %, it is clear from the occupation number 
interpretation that <A(9;)tA(gm)' and A(ym)'A(y,)t should both take 
|(n,)) to vectors which are proportional. The simplest way to ensure this 
is to require that A (pı)! A(Pm)! =CA(Pm)! Alqı)"; then c= +1, and one is 
thus led to distinguish between two types of systems: Bose-Einstein (B-E) 
and Fermi—Dirac (F-D). The requirements are (with [4,B]=4B-— BA, 
{A,B}=AB+BA) 


) A(g) antilinear in 9; 
(B-E) (ii) [4(¢),4(%)] = 0; 
(ili) [A(¢),A(p)"] = C9) 1. 
) A(p) antilinear in 9; 
(F-D) (ii) {A(g),A (p)} = 0; 
(iii) {A(),4 ($)} = (f.p) 1. 


(In this brief discussion we do not insist on domain considerations in inter- 
preting these rules.) The corresponding particles are respectively known as 
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Bosons and Fermions. It is a fundamental empirical fact that particles with 
integral spin are Bosons while those with half-integral spin are Fermions. 
(e) Cyclicity of vacuum. It is natural to assume that 


A (p) |0) =0. 
The cyclicity assumption means that the vectors 


Alp) Alpo)... A(Pm)' |0) (pe X) 


span a dense subspace of #. 

(f£) Covariance. There is a strongly continuous unitary representation T 
of the unitary group of # in #, commuting with N. Thus T(U) Æ n =Æ n 
for all n, Ueunitary group of X. 


It is not difficult to show that once the type of the particle is specified 
(that is, a Boson or Fermion), then there is essentially one # associated 
to X. # is called the Fock space; it was first discovered by V. Fock. We 
give a brief description of it. 

Bosons. H=S(XH), the symmetric algebra over X. This is defined 
essentially in the same way as in the case when dim(.%) < œ. More precisely, 
let T(4)=T be the covariant tensor algebra of X, i.e., 


T=(1@OX P(X OX)... = Ð (2), 
m2 
where we take the infinite Hilbert space direct sum. Let J, be the closed 
linear span of all tensors of the form 
U®@(xOy—Y OX) Or, 
where z, ye X, U, VE Dimon X ©” = T°. Then 
S=T/J,. 
dfn 


Asin the finite dimensional case, we have a linear Hilbert space isomorphism 
S x Temm, 
where 7" is the subspace of symmetric tensors: 


Msymm = D (H 2m) symm, 


m20 
S is of course graded, the grading being inherited from T: 
I= O8Im  #n=Sw 
m20 
and the algebraic sum 


S°= F, San 


din m20 
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is an algebra. The operators A(p) and A(y)' are unbounded, they are first 
defined on S° and then extended by formation of closures. On S° they are 
defined by 


A(gp)t: ur vm + 1 pu (“EÑ m) 
Alp) : ur 09) (ue Sp), 


where (p) is “differentiation” along p, defined as the unique derivation 
of S° such that 


Ay) p = (p,p) 1. 
The verification that these definitions lead to a B-E system is straight- 
forward. If U is a unitary operator of %, it lifts naturally to T', and hence 
induces a unitary automorphism I'(U) of S which leaves S®° invariant and 
is an algebra automorphism of it. Of course |0) = 1. 
Fermions. The relations 


A(9)'A(y)+A(p)A(p)* = 1 


show that the A (p) and A()' are bounded operators. For the standard model 
we choose # =A, the exterior algebra of X. It is defined as T'/J* where J, 
is the closed linear span containing all tensors of the form 


U@(zGzt+y@Ox) ©», 


where x, ye X, u, ve T°. A is graded in the obvious way; A,,=#,, is the 
image of X 9m under T —> A. If we write 


A= > A, (algebraic sum), 
m>0 


A? is an algebra. As usual A denotes multiplication in A. The creation and 
annihilation operators are defined by 


A(g)t:urrvm+lerau  (ucAn), 


Alp): wis = 2() (ue An), 


where 0(¢) is the unique endomorphism of A° such that 


(i) 3(p)i =0, 
(i) alp)p' = (p,p) 1, 
(iii) S(p) (Um A Un) = (IP) Um) A Un +(— 1)" Um A Op) Un 
(UmE Am Un EAn). 


If U isa unitary operator of #, T (U) is the corresponding naturally induced 
unitary operator of A; T(U) is an algebra automorphism of A°. Finally 
|0>=1. 
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In the Fock representation for a B-E system choose a real form Ær of 
X and write 


P (9) — A(9) ate Qly) = aor. 


Then we have 


[P(o P= 0, [RRE e) PH = Kop). 


These are analogues of the Heisenberg commutation rules, and reduce to 
them when dim(.%) <œ. The uniqueness of the irreducible representation 
of the Heisenberg commutation rules (in the “ Weyl” or “integrated ” form) 
when dim(.%) < œ makes clear why the B-E system is unique. We can in 
fact use the Schrödinger representation to construct another model for the 
Fock representation. Let #=L*(%,R) (with respect to the Lebesgue 
measure) and let 


P(9) = * a9), V(r) = (9,.) (multiplication by (9,.)). 


Then it is enough to take 


Alg) = P(g) = Alg) = 29 we. 


The vacuum vector is the function 
exp (—4]||[?). 


This type of model can be constructed even when dim(.%)=o. It is, 
however, technically more complicated because there is no translation 
invariant (or translation quasi invariant) measure on “Rp when dim(.%#) = œ, 
and one has to use a more sophisticated type of integration theory, namely 
integration with respect to the isotropic Gaussian weak distribution dg on 
Ag (cf. I. E. Segal in Les Problèmes mathématiques de la théorie quantique des 
champs, CNRS, Paris, 1959, pp. 57-103). 

The identification L?( Xr, dg) x # allows us to “quantize” any classical 
dynamical system whose configuration space is Xr and whose dynamics 
is linear and unitary on X. If (U,) is the classical dynamical group, (I'(U,)) 
is the quantized dynamical group; moreover, the grading #7 = Pn H n leads 
to a particle interpretation. This was how Dirac quantized the electro- 
magnetic field and constructed the quantum theory of radiation, namely 
interaction of light with matter. 

For the Fermion case, we keep the real form Xp as before but now define 


P(g) = A(g)+A(g)',  iQlp) = Alp) — Alp). 
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Then P and Q are linear maps of Xk into the space of bounded self-adjoint 
operators on # satisfying 


P(g)? = Ql} = (p,p),  {P(7),Q(p)} = 0. 


This suggests the introduction of the Clifford algebra, Cliff (Z) associated to 
a real Hilbert space £, which is a complex algebra with unit 1, containing 
£ and generated by it, such that 


w=(uul uef. 
Cliff(2) exists and is essentially uniquely determined by 2. If we put 


Rip +ip) =Q) +P), p, pep, 
and write X for the real Hilbert space underlying X, 


(uwy = Re(u,0)) 


(P,Q) =R 


then 


is a bijection between F-D systems based on % and representations R of 
Cliff (4) such that R(u) is bounded self-adjoint for all u e %. 

For a more systematic study of the framework of second quantization 
the reader may refer to F. A. Berezin, The Method of Second Quantization, 
Academic Press, New York, 1966, and J. M. Cook, The Mathematics of Second 
Quantization, Trans. Amer. Math. Soc., 74 (1953), pp. 222-245. In the 
original article of Dirac referred to earlier, he used this framework to study 
the interaction of an atom with the electromagnetic field and obtained the 
known formulas for probabilities of transitions between the energy levels. 
For a detailed treatment see Dirac, loc. cit., and von Neumann [1] (pp. 254- 
294). Dirac’s paper inaugurated the beginning of an entire era in which a 
major goal was to erect a theory of quantized fields obeying the principle of 
special relativity. To get a picture of what has been done, as well as what 
remains to be done, we refer the reader to the following books and to the 
articles referred to in them: 


J.Schwinger (Ed.), Selected Papers on Quantum Electrodynamics, Dover, 
New York, 1958. 

S.Schweber, An Introduction to Relativisitic Quantum Field Theory, Row, 
Peterson and Co., New York, 1961. 

J.M. Jauch and F. Rohrlich, The Theory of Photons and Electrons, Springer- 
Verlag, Berlin, 1980. 

N. N. Bogoliubov and D. V.Shirkov, Introduction to the Theory of Quantized 
Fields, Wiley, New York, 1980. 

J.Glimm and A. Jaffe, Quantum Physics: A Functional Integral Point of 
View, Springer-Verlag, Berlin, 1981. 


CHAPTER IX 
RELATIVISTIC FREE PARTICLES 


1. RELATIVISTIC INVARIANCE 


The fundamental assumption in Newtonian mechanics is the absolute 
nature of space and time. That our empirical knowledge of the physical 
world points up to the contrary was one of the cornerstones of Einstein’s 
great critique of space and time. The analysis of Einstein and Lorentz of 
the empirical and mathematical nature of the physical phenomena 
established that only space-time, as a four-dimensional manifold, has an 
invariant physical significance, and that its familiar splitting into space 
and time is essentially dependent on the observer. Each observer was 
thus seen to describe only the points of the space-time manifold in his 
vicinity by tour coordinates; the formulas describing the transformation 
of coordinates of the same space-time point by two different observers 
involved smooth functions. In other words, space-time became an ab- 
stract C”? manifold and each observer was seen to describe a neighbor- 
hood of this manifold in terms of local coordinates. Only statements 
which retain their meaning in every coordinate system were asserted to 
have any physical significance, i.e., the global differential geometric state- 
ments. Einstein’s theory of gravitation is, for example, a formulation of 
the theory of gravitational phenomena involving only intrinsic geometric 
objects like connections and tensor fields. 

In problems dealing with the quantum aspects of atomic systems, it is 
usual to assume that gravitational fields have no influence on the physical 
phenomena (cf. Dirac [1], p. 253). As a consequence of this assumption, 
the group of transformations which connect the different observers is 
drastically reduced. We shall make the assumption that the observers are 
what are known as inertial, and that the relevant group of transformations 
is the so-called group of special relativity. We shall now proceed to make a 
few explanatory remarks on these concepts. 

In special relativity it is assumed that each observer is able to establish 
coordinates for the whole of the space-time manifold and, moreover, that 
he uses a rather special coordinate system to achieve this. One makes the 
assumption that the notion of a classical free particle, not acted upon by 
external forces, is meaningful for the observer, and that the trajectories 
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of such a particle are described by linear functions of the time coordinate 
in his coordinate system. It is customary to call such coordinate systems 
and observers inertial. It may be noted that an inertial coordinate system 
is simply one in which the first law of Newton is valid—every particle is at 
rest or continues in its state of uniform motion in a straight line, unless it 
is acted upon by external forces to do otherwise. Given two inertial 
coordinate systems, say O, O’, each of which maps the space-time manifold 
in a one-one fashion onto Rt, a correspondence To,o, is established 
between the points of Rt; (2 ,%1,%2,%3) and (Xp ,%,',%, ,%3') correspond 
under To,o if and only if they represent the same space-time point, in 
O and O’, respectively. 
What can one say about the mapping T'o o,, 


(1) T'o,0° : (X9,Xy,%q,%gy) > (Lo ,Xy" Xo’ Ly’)? 


It is natural to expect 7'9 > to be an inhomogeneous linear transformation; 
for, as both coordinate systems are inertial, the trajectories of a free 
particle must be represented by straight lines in both systems, i.e., T'p 0. 
must map straight lines into straight lines. Hence it must preserve 
coplanarity and parallelism. Consequently, if we pass (through the use of 
homogeneous coordinates) to the associated projective geometry of R5, 
To,o Will induce an automorphism of this projective geometry and is 
therefore described in R5 by a unique linear automorphism (cf. the results 
of Chapter II); back in R4, this would lead to the affine nature of To, o- 
It is roughly along these lines that one is led to the basic hypothesis of 
special relativity: space-time is an affine spacet; each inertial observer 
describes it with an affine coordinate system, and the transformations con- 
necting the coordinates of the same point in two different inertial systems are 
inhomogeneous linear: 


3 
(2) x; = > Qj jC; + Uj. 
j=0 


For three inertial observers O, O’, and O”, we have: 
(3) T' 0,0" = T'9’,0°1'0,0" 
(4) Too = T5,b. 


Clearly the set of all To, o forms a group. This group is called the space- 
time relativity group. It has been obtained explicitly as a group of 
inhomogeneous linear automorphisms of Rt. In the abstract affine 
space-time it gives rise to a group of affine automorphisms. 


T % and xp’ are the time coordinates: x,, x; (j=1, 2, 3) the space coordinates. 
tł Cf. Chapter VIII, Section 4. 
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However, not every inhomogeneous map of the form (2) can arise as a 
transformation between inertial coordinate systems. Further examination 
of the empirical nature of space-time phenomena yields certain restric- 
tions on the To,o. Before Einstein and Lorentz, it used to be assumed 
implicitly that the notion of simultaneity of two events was an invariant 
one, i.e., if O, O' are two inertial coordinate systems, if (£9,%1,%2,%3) and 
(YoY1:Y2Y3) are the points of R* which correspond under O to two space- 
time points, and (a9’,x,'ao',73') and (Yo Y1 Yz2 Y3) represent the same 
pair of points under O’, then x)’=y,’ if and only if xo =y9. It follows from 
this that in the formula (2) for To,o, we must have 


(5) Lo = AgoLy + Uo. 


If we assume that the unit of time measurement and the sense of time 
propagation is the same in both systems, we must have ao)=1, i.e., 


(6) Lo = To + Up. 


Furthermore it was assumed that the spatial distance of pairs of space- 
time points with the same time coordinate also remained unchanged, i.e., 


3 
(x;— yi)? = 2 | tiy) 


From this it follows that the matrix of order 3 x 3 in (2), 


(8) W = (a) <i,js3 


is orthogonal. Consequently, if we write x for the column vector with 
components 21, Xo, and zz, 


Me 


(7) 


i=l 


x’ = Wx+aovt+u, 
(9) 


To 


Xo F Uo, 


where W is an orthogonal 3x3 matrix, v,ueR?, and ueR!. The 
collection of all transformations (%9,x) —> (X9’,x’) of the form (9) is easily 
verified to be a group. 

We now assert that every transformation of the form (9) is a possible 
transformation between two inertial coordinate systems. Since the 
possible transformations constitute a group, it is enough to exhibit a 
class of transformations which actually arises in practice and which also 
generates the group of all transformations of the form (9). If we put in 
(9) wj=0, u=v=0, we obtain a transformation which connects two 
coordinate systems with different coordinate axes but with the same space 
and time origin, and the same sense of time flow, and which are mutually 
at rest; evidently all such transformations are possible. If we put W=1 
and v=0, we obtain transformations which connect two coordinate systems 
with different space and time origin, have the same sense of time flow, 
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have coordinate axes in the same orientation, and are mutually at rest. 
Clearly all such have to be included in the relativity group. Finally, if we 
put W=1, uwy=u=0, we obtain the transformations 


t= LHL, 


(10) , 
To = Xp. 
The formula (10) represents the transformation which connects two 
coordinate systems which have the same time origin and sense of time 
flow, which are moving uniformly with respect to each other with velocities 
v and —v, and whose frames coincide for 7) =x, =0. These also have to be 
admitted. An easy calculation shows that these three classes of trans- 
formations generate the group of all transformations of the form (9). In 
other words, the physical assumptions we have made lead one to the 
group described by (9) as the relativity group. It is called the orthochronous 
inhomogeneous Galilean group, and the corresponding physical principles 
go under the name of Galilean relativity. If we assume det W = +1 in (9), 
we obtain the usual inhomogeneous Galilean group. The transformation 
x’ = X, 
(11) 
Ly = — To, 
represents time inversion. If we add this to the orthochronous group 
determined by (9), we generate a larger group called the complete 
inhomogeneous Galilean group. 

When dealing with bodies and particles that move with very high 
velocities, these principles are violated quite decisively. The well known 
Michelson-Morley experiments led to the remarkable conclusion that in 
every inertial coordinate system, the velocity of propagation of electro- 
magnetic and light signals in vacuum remained constant. It was Einstein 
who embodied this as a basic principle and deduced from it the conse- 
quence that the affine transformation (2) must be such that the same 
numerical value is obtained in both coordinate systems for the velocity 
of these signals. Choosing this numerical value to be the unit of time 
measurement, this means that for corresponding points of R+, the relation 


3 3 


(12) Yo— 20)? -20 y-a)? = (Yo' —%')?— > (yy —2/')? 


j= j= 
must be satisfied. If we now substitute (2) in (12), we find, for the 4x4 
matrix, 
(13) A = (4;)7,;=05 
the equation 
(14) AFA =F, 
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where F is the diagonal matrix: 
(15) F = diag{1,—1,—-1,—1}; 


equivalently, the linear transformation induced in R?! by A preserves the 
Minkowskian form 


(16) G + o — x? — Xo" — 4,7. 


The set of all A satisfying (14) is called the Lorentz group, and the set of 
all affine transformations (2), with A Lorentz is called the complete 
inhomogeneous Lorentz group. It can be shown (cf. Section 2) that 
|@o9| = 1 and that the set of all transformations with ao, > 1 is a subgroup, 
the orthochronous subgroup. Exactly as we showed in the Galilean case, it 
is possible to argue that the entire orthochronous Lorentz group must be 
admitted as the group of special relativity. To prove this, it is again 
sufficient to exhibit a generating class of transformations which actually 
arise. The transformations 


x = Wx+u, 
(17) 


Xo Lot Uo; 


all arise explicitly; they connect inertial coordinate systems which are 
mutually at rest and which differ in their space and time origins and the 
orientations of their spatial coordinate axis but have the same sense of 
time flow. Consider, on the other hand, the transformations 


£o = (1—02) 2x9 + o(1 — v?) 12a, 

£ = v(1 — v?) Yxa + (1 —v?)~ Yr, 
(18) , 

Xo = Vo; 

X3° = Xz, 


for values v with —l1<v<l1. Equation (18) represents, for fixed v, a 
homogeneous Lorentz transformation. The space-time point which 
corresponds to the point (7,0,0,0) in the O’-system is represented by 
(t(l —v?)- 1/2, — rv(1 — v?) 1/2, 0, 0) in the O-system. This shows that (18) 
represents the transformation which connects O and O’ when O is moving 
relative to O’ with uniform velocity v along the z,-axis. Transformations 
of the form (18) must therefore be included in the space-time group. 
By symmetry, the transformations in which x, and 2g play the role of x, 
are also admissible. It can be shown that these three classes of trans- 
formations, together with those defined by (17) and the time reversal 
(11), generate the complete inhomogeneous Lorentz group (cf. Section 2). 
In other words, in the special relativity of Einstein and Lorentz, the 
space-time group in question is the complete inhomogeneous Lorentz 


group. 
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It is usual in physical literature to do all this, not in natural units, but 
in the usual, say, C.G.S. units. If c denotes the velocity of light in vacuum 
as computed by inertial observers, then xọ=ct, where t is the time 
coordinate, and (18) will appear in the form 


1 vje? 


t = —— t+ i, 
V1—v?/e? = V1—v?/c? i 
v 1 
19 v = — tH E), 
a») "Vinee ine 
Ta = Xa; 
£3 = Xz. 


When v/c — 0, this formula becomes 
t’ = ft, 
x, = 2, +12, 
(20) 1 1 
Lo = Ta, 
ty’ = T3, 


which are the formulas expressing the same relation between the two 
frames in Galilean relativity. In other words, we may view the application 
of Galilean relativity as satisfactory when the velocities of the particles 
and bodies involved are small compared to c. For high-speed particles, the 
formulas (19) must be used, i.e., only Lorentz relativity is to be postulated. 

These remarks conclude our extremely brief discussion of special 
relativity. The reader who is interested in a more detailed analysis of 
space and time is referred to Weyl’s treatise [6]. 

The main problems of relativistic particle physics concern the various 
high-speed particles and their interactions. A general theory of such 
phenomena must combine the principles of quantum theory with those of 
special relativity to obtain the relevant equations of motion and to draw 
the necessary physical conclusions. In such generality these problems are 
still unsolved. In this chapter we shall be concerned only with the free 
particles and their equations. Before entering into a discussion we shall 
formulate mathematically the requirement that the physical description 
be relativistically invariant, i.e., the physical content of the systems 
under consideration be the same for all inertial coordinate systems. 

Consider now a system © which we want to describe in an invariant 
manner. We denote by G the space-time relativity group, namely, the 
group of all affine transformations T'o, o (2) which connect pairs of inertial 
systems. Each observer O associates with © a Hilbert space # whose 
rays are identified by him with the dynamical or Heisenberg states (cf. 
Section 1 of Chapter VIII) of G. In the coordinate system O, G is described 
by the determination of the set of all its dynamical states as the rays of 
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Ho. In general, o will vary essentially with O. We shall formulate the 
relativistic invariance of (the description of) G by the requirement that 
Ho be independent of O, i.e., the set of motions of © is the same in all 
inertial coordinate systems. We shall write # for this Hilbert space. Even 
though #@ is the same in all inertial systems, the same physical state of © 
may be described by two entirely different rays of # by two different 
observers, O, O’. Hence we assume the existence of a one-one mapping 
to o of the set of all rays of # onto itself, 


(21) too . r —> 7, 


such that r and to, o'[r] represent (in O and O’) the same state of S. Now, 
the transition probability between the dynamical states is an invariant 
numerical quantity, and hence ty 9 must preserve all the transition 
probabilities. This means that to o is a physical symmetry in the sense of 
Chapter IV, and consequently is induced by a unitary or anti-unitary 
automorphism of # (Wigner’s theorem). We write Do,o, for this sym- 
metry. Clearly, we must have Do,o=1 and Do 9» = DororD 0,07. If we 
write T =T o.o, for the element of G whick connects O and O’, D must 
depend only on T, as otherwise there would be an essential influence of © 
on the coordinate system O. In this manner we obtain a representation 
T —> D, of the group G into the group of symmetries of # such that 
to,o is induced by Dy, o for all pairs O, O’ of inertial coordinate systems. 
If G is a Lie group and G° is the component of identity of G, then we may 
assume that T —> Dr is a homomorphism of G° into the projective group 
of # (cf. Wigner [1], [4] for a detailed discussion). 

We thus see that associated with every relativistically invariant system 
© there is a pair (#,U) where # is the Hilbert space whose rays represent 
the dynamical states of GS, and U is a (Borel) homomorphism of the 
connected component of the relativity group into the projective group of 
HA. We can replace U by a projective representation; any projective 
representation which induces the same homomorphism in the projective 
group may be used to study the behavior of the states under the relativity 
transformations. The physical interpretation of the representation will be 
along the same lines as in our analysis of systems covariant under con- 
figuration symmetries. We shall, of course, apply this only for the free 
particles, for which the interpretation is quite convincing. 

It is easy to see how one may obtain the temporal evolution of © from 
the representation U. To see this let us fix an inertial coordinate system O. 
We may then treat the rays of the Hilbert space # as describing the 
states of ©, the state at time t=0 being identified with the same ray 
which describes the dynamical state. Consider now, for any real number a, 
the observer O’ whose coordinate system is connected to that of O by the 


transformation 7'_.,, , 
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where 
(22) Lo = %y—-a, Ly = 4; (l <j < 3). 
Then the map 
(23) a> Ur (© < @ < œ) 


gives the dynamical group of the system S. 

We have already seen that the group R! has only exact multipliers and 
so we can replace (23) by a representation a > V,=exp(—iaH). In view 
of our analysis in Chapter VIII it is natural to assume that H represents 
the total energy of the system. If U itself is a representation, then, 


(24) Ur, = exp(iaH). 


Equation (24) can be generalized to describe the total linear momenta. 
Let ae R‘, let a = (apa), and let T, be the transformation x > x’, where 


Xo = To— do, 
x’ = x—a. 

Then there is a unitary representation s > W, of R! such that s > W, 
induces the same homomorphism into the projective group as s —> Ur, 
where a=(0,a). We write W,=exp(tsB,) and call B, the total linear 
momentum in the direction a. We may, of course, consider all the space- 
time translations at the same time by considering the restriction of U to 
the group of all space-time translations T. If we assume for the moment 
that the multipliers of the relativity group have exact restrictions to the 
translation subgroup, we can obtain a representation a —> W, (a=(do,a)) 
which can replace the restriction of U to the translation subgroup. By 
the general theory of representations of abelian groups, there exists a 
unique projection valued measure Q on the dual Pt of Rt such that 


(25) Wa = [exp ika.p>dQ(p) 


(<...) is the duality between R‘ and P4). The spectrum of Q is called the 
spectrum of the system. Notice that the energy and the momentum operators 
commute with one another. This leads, in the usual fashion (cf. Chapter 
VIII) to the law of conservation of the linear momenta. 

It is clear that those forms of the representation U in which Q appears 
in its canonical form will be especially simple to analyze mathematically. 
Such descriptions are called momentum space representations. 

Experiments involving the violation of parity have shown that one does 
not always have invariance of the physical content of theories with respect 
to the Lorentz transformations which do not belong to the connected 
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component of the identity of Œ. Also it is well to remember that only ele- 
ments of the connected component of the identity of G are represented by 
unitary operators. For the other elements it is necessary to use both uni- 
tary and anti-unitary operators. We shall examine this question later on. 

We shall call a system elementary if the associated projective representa- 
tion of the connected component of the relativity group is irreducible. In 
intuitive physical terms we can describe this by saying that starting from 
any state we may obtain all the states by Lorentz transformations and 
the principle of superposition. Two elementary systems may be said to be 
physically equivalent if there exists a unitary or anti-unitary isomorphism 
of the underlying Hilbert spaces which intertwines the associated pro- 
jective representations. An elementary particle in tts free state or a free 
particle is defined to be a physical equivalence class of elementary systems. 
One of the main results of this chapter is the classification of the elementary 
particles in terms of their mass and spin. 

In classical and also in nonrelativistic quantum mechanics, the free 
particles do not play a very important role. However, the knowledge of the 
structure of the transformation laws for the states of free particles is very 
basic in most problems of particle theory. The reason for this is, roughly 
speaking, that most of the experiments involve scattering of various kinds 
of particles, and the incoming and outgoing are clearly those of free 
particles. It is because of this that a study of the transformation laws of 
free particles is quite fundamental. 


2. THE LORENTZ GROUP 


In this section we collect a number of facts involving the Lorentz group. 
The notations that will be used throughout this chapter are also introduced 
in this section. 

We begin with the homogeneous Lorentz group, denoted by ZZ. It is 
the group of all 4x 4 matrices ~ 


(26) L = (ty)ij=0 

such that 

(27) LFL = F, 

where ¢ denotes matrix transposition, and F is given by 
(28) F = diag{1,—1,—1,—1}. 


We write €o, €41, €2, €3 for the usual basis vectors so that (2%,%1,%,_,%3) = 
Lalo H L181 HL + Lgl. Each L induces a linear transformation of R* and 
(27) is the condition that it preserve the quadratic form: 


(29) G : Xo? — 241? — 3 — r32. 
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The group # is a closed subgroup of the group of all invertible linear 
transformations of R$, and as such is a lesc group, even a Lie group. We 
shall examine its topology. Consider an L e H. In view of (27), 


(30) det L = +1. 


Further, L is an orthogonal matrix if and only if L preserves both the 
forms x,7+227+2,7 and x2. Hence such an L is of the form 


+1 00 0 
(31) t=-{ ° 

0 W 

0 


where W is a 3 x3 orthogonal matrix. We consider next a positive definite 
symmetric matrix L. Let L"? denote the unique positive definite sym- 
metric matrix whose square is L. We contend that if L is in A, so is L”2. 
In fact, from the equation LFL=F we obtain (L"/?)?=(FL-?F)?2, 
which shows, in view of the uniqueness of Z1/?, that ZV? = F(L1/?)-1F 
or that L2 e Ĥ. By a trivial induction we conclude that L™2™ e Ĥ for all 
integers m and n. From this observation we can determine the structure 
of the set. of all positive definite Lorentz matrices. Consider, to this end, 
the mapping 
S —> eS 


of the vector space of 4x4 real symmetric matrices into the space of 
symmetric positive definite matrices. It is well known (cf. Chevalley [1], 
pp. 14-15) that this is a homeomorphism. From what we saw just now, it 
can be asserted that whenever eS e Ñ, e™!2™s e A for all m and n and hence 
that e*S e Ĥ for all real a. If we differentiate the equation 


(e25) F(e%S) = F 
with respect to a, at a=0, we obtain 
(32) SF+ FS = 0 (S symmetric). 
Conversely, suppose that S is a 4x4 symmetric matrix satisfying (32). 
If we write g(a) =e% Fe", we find 


dp — pas aS _. 

Ta (a) = e(SF + FS)e* = 0 

for all a, proving that p(a) is a constant matrix; the equation ¢(0) =9(1) 
tells us that eS e H. In other words, we have proved that for a 4x4 


symmetric matrix S, e5 e A if and only if S satisfies (32). The set of all S 
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which satisfy (32) is a three-dimensional linear space p with a basis given 


0 1 0 0 00 1 0 
u 0 
01 — 0 0 ’ Moz — l 0 ’ 
0 0 
(33) 
000 1 
Mo = 0 0 
1 
If we write 
(34) H? = {L: LeH, L positive definite symmetric}, 


then we may conclude that H? is homeomorphic to the three-dimensional 
Euclidean space; S — es is a homeomorphism of p onto H”. In particular, 
1 e H? and H? is connected so that H” c< H, where H is the component of 
the identity of H. Consider now an arbitrary L e H. Since F = F~}, the 
equation D-/F(L')-1=F shows that (L7!)'eH and hence Lt itself 
belongs to H, i.e., 


(35) At = dH. 


We now examine the polar decomposition of L, i.e., L= L,(L'L)'/*, where L, 
is orthogonal. From (35) and our earlier discussion we know that (LEL)? € H? 
and hence L, eH. Hence L, must be of the form (31). In other words, every 
L e Ĥ can be uniquely written as 


1 0 0 0 


(36) L=e 


where W e SO(3), e= +1, and e’-= +1, and Se}, i.e., S is symmetric and 
satisfies (32). Since the 0-Oth entry of eS is seen to be =1 by direct com- 
putation, it follows from (36) that for L, 


Qo! = 1, 
(37) ool 

Eloo > Q. 
Moreover, det L =e’? so that 


(38) e'(det L) = 1. 
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Thus e=1 and e’=1 if and only if aj9,>1 and det L= +1. Collecting all 
these facts, we have 


Theorem 9.1. The component H of the identity of H is given by 


(39) H = {L : a% > 0, det L = +1}. 
The map 
1000 
0 
W,S —> es 
0 W 
0 


is a homeomorphism of SO(3) x p onto H. For any Lorentz L, |doo| > 1 and 
det L= +1. A has, besides H, exactly three connected components and these 
are the H-cosets of H determined by the three elements I,, I,, and I, where 
I,, I, and I, are obtained from (36) by putting S=0, W=1, and for e and e’ 
the respective pairs of values (1,— 1), (—1,— 1), and (—1,1). 


We write K for the subgroup of H corresponding to SO(3). 


The Lie Algebra. Since H is a linear group, we shall canonically identify 
its Lie algebra ) with a Lie subalgebra of matrices or endomorphisms of 
Rt, the bracket being the usual commutator A, B —>[4,B]= AB — BA. 
The exponential map then becomes the usual one, S—>e’. A matrix 
S € h if and only if e*’ e H for all real a. Arguing as in (32), we find that 


(40) h = {S : SF + FS = 0}. 


fh is six-dimensional. If we write f for the set of matrices 


(41) f={S:Seh, Se, = 0}, 
then the elements S of f are precisely those of the form 
00 0 0 
(42) S = ° ; 
0 Sy 
0 
where S, is skew symmetric: 
S = —-8, 


t is the subalgebra of ) corresponding to SO(3) imbedded in Ë. § is the 
direct sum of t and p: H=f+p, EA p=0. 
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A useful basis for ) may be constructed in the following manner. Let 
e; denote the 4 x 4 matrix whose only nonzero entry is the 2-jth, which is 
1 (2,7=0, 1, 2, 3). Let 


(43) M; = -Ma = Cig — ĉji (2,9 = l, 2, 3t # 7), 
(44) Mo = Mai = eoit eio ( = 1, 2, 3). 


Then Mia, Ma3, Mg, Mois Moo, and Mog span h. The commutation rules 
are given by: 


[MM a] = Mik (i # J #k# 1, i, J, k = 1,2, 3), 
(45) [MoMo] = Mi; (ÆJ, i,j = 1,2,3), 
[Moi M 5] = 0 (2 FJ F k + i, t, j, k = l, 2, 3), 


[MM oi] = -Mo (i £ ds 1,9 = l, 2, 3). 


The Covering Group. We shall introduce in an explicit fashion the 
universal covering group of H. We consider the real four-dimensional 
space of 2x 2 Hermitian complex matrices and the linear isomorphism of 
R! with it given by 


(46) A : (Xoti £23) —> Toso + X101 + T203 + T303, 


where ay is the unit matrix, and o,, o,, and c, are as in (55) of Chapter VI. 
The trace of this matrix is 2x and its determinant is xo? —x,? — £3 — x3”. 
Consider now the group 


(47) H* = SL(2,0), 
and for any m e H* the linear transformation 
(48) 5(m): E —> mém* 


of the space of 2 x 2 Hermitian matrices. In view of (46), 5(m) induces a 
linear transformation in R4. Let us also write 6(m) for this transformation. 
From (48) we see that m —> 6(m) is a homomorphism of H* into the 
group of invertible linear transformations of R‘. Since det m=1, det = 
det mém*, so that the transformation 6(m) must preserve the quadratic 
form z? — z1? — 22 — x32. Thus 8(m)¢ H and m-> 8(m) is a continuous 
homomorphism of H* into 7, hence into H. From (48) it follows easily 
that 


(49) kernel (ô) = {+1}. 


ô is thus a local isomorphism and therefore its differential 5 induces an 
isomorphism of the Lie algebra of H* into that of H. But the Lie algebra 
of H*, being canonically isomorphic to the Lie algebra of 2 x 2 complex 
matrices of trace zero, is, as a real vector space, six-dimensional. Conse- 
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quently, $ is surjective. From this we infer at once that 5 maps H* onto 
the connected Lorentz group H: 


(50) S[H*] = H. 


The topology of H* may be analyzed in the same way as the topology 
of H. The technical details are similar. Using polar decomposition of 
elements of H*, we conclude that H* is homeomorphic to the Cartesian 
product of K*, its unitary subgroup, and the space of positive definite 
complex Hermitian matrices of determinant 1. Since S + e5 sets up a 
homeomorphism of the latter space with the three-dimensional real vector 
space of 2x2 Hermitian matrices of trace 0, we find that H* is homeo- 
morphic to K*xR*. In particular, H* is simply connected. ô is thus a 
covering homomorphism of H* onto H. We note that for an me H*, 
5(m) is orthogonal if and only if the map  —> mém* preserves tr(&) as 
well as det(&). This can happen if and only if m is unitary. Thus 


(51) SiK*] = K,  K* = 8-1(K). 


The Lie algebra representation 5 corresponding to 8 is easily computed. 
Let )* be the Lie algebra of 2 x 2 complex matrices of trace 0, 


(52) ht = {X :trX = 0, X complex}. 
Then 
(53) 8(X)é = XE+ EX*. 


We take the basis {t0,,202,103,0,,09,03} for }*. An easy calculation gives: 


§(io,) = 2M 93, 8(iae) = 2M 3), §(to5) = 2M 2, 


(54) 
d(o,) = 2M 91, (9) = 2M o2, 3(a3) = 2M o3. 


From (54) we see that 6 maps Hermitian elements of h* into symmetric 
elements of ). Hence 5 maps the set of positive definite elements of H* 
onto H”. Further, for a unitary ke K*, ô(k)e K. Hence, for a general 
element m of H* of the form m=kp, k unitary, p positive-definite Hermi- 
tian, we have 8(m*) = 6(p)8(k~!) = 8(p)6(k)' = (8(k)8(p))' so that 


(55) d(m*) = lm)! (m e H*). 
On the other hand, let ¢ be the reflection in &* given by 
(56) C : LoL1 T223 —> Loi, —Xq,2%3. 


¢ corresponds under the map (46), to the conjugation £ —> £ of Hermitian 
matrices. Since mém* =(mém*)~, we have 

(57) 6(m) = [d(m)é. 

Combining (55) and (57), we find 

(58) 8(m') = £8(m')e. 
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Representations of SL(2,C) in Finite Dimensional Vector Spaces. The 
group H* has no finite dimensional unitary representations other than 
the trivial representation, but it has a number of nonunitary representa- 
tions in finite dimensional vector spaces. We shall now describe these. 

We write W =C? and W! for the subspace of symmetric tensors of rank 
2j (j=0, 4, 1,---) in the tensor product W @ W @---® W (2) factors). 
The representation m —— m Q m @---@ m leaves W’ invariant. We write 
Di for the representation of H* defined by W’. D’ is already irreducible 
when restricted to the unitary group A*. If {e,,e,} is the standard basis for 
W, the matrix entries of the representation of H* in W @---® W, 
relative to the basis {e,, Q-Q e,,} (N =2j), are 
(59) f(r: tts Ty, Sitt t’ Sys m) = Arsi’ *Arysy> 
where m e H* is the matrix (a,,)2,-,. The f’s are complex analytic 
functions on H*. D’ is thus a complex analytic representation of H* whose 


restriction to K* is equivalent to the irreducible representation of K* 
associated with j. 

For any j, we may consider the representation Di conjugate to D’, i.e., 
we take a basis for W’ and conjugate the matrix representation obtained. 
Di is also an irreducible representation of H*. Its restriction to K* is 
once again equivalent to the irreducible representations of A* corre- 
sponding to j. No D* is equivalent to any D’ except when k = j = 0. 

It is known that for arbitrary j, 7’ (j, 7’ =9, $, 1,---) the representations 
(60) Di : DI @ D 


are all irreducible and that these exhaust, up to equivalence, the irreducible 
finite dimensional representations of H*. For a proof the reader may refer 
to Weyl ([2], pp. 267-268). 


The Inhomogeneous Group. The Lorentz group H acts in Rt as a linear 
transformation group and hence one may form the associated inhomo- 


geneous group, 
(61) G=Hx’R4 


where we use the notation x’ to denote the formation of the semidirect 
product. For (L,x) and (L’,x’) belonging to G, 


(L,x)(L',x’) = (LL, Lx’ +2). 
The group H* may also be considered as acting on R* through the action 
(62) mx = 6(m)x (m e H*, x e R$). 
We write G* for the associated semidirect product 


(63) G* = H* x' Ri, 
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The subgroup of @ given by 
(64) G = Hx’ Rt 
is the component of the identity in G, and the map 
(65) 5:m,x—> d(m), x 


is the covering homomorphism of G* onto G; G* is obviously simply 
connected. 

The group G is generated by G and the “inversions” (J,,0), (J,,0), and 
(I,,,0), where I, is space inversion, I, is time inversion, and Iy is space- 
time inversion: writing (%9,X) for peo +211 + L262 + L363 

I, : (%,X) —> (£o, — X), 
(66) I, : (%q;X) > (—2p,X), 

Ly ' (%o,X) —> (— zo — X). 
The elements J, (r=0, 8, t, st; Io is the identity) form an abelian group 
having four elements, the group of inversions, which is denoted by H,,,: 


(67) H inv = {Mols LoLs. 


The group H;,, is a discrete subgroup of the homogeneous Lorentz group 
H, and Ñ is the semidirect product of H,,, and H: 


(68) H = A, Af, H A H inv = {1}; 


the action of H,,, on H is of course through inner automorphisms. 

For any 7, L >J,LIJ,~* is an involutive automorphism of H, and the 
assignment which sends 7 to this automorphism is a homomorphism. As 
H* is the universal covering group of H, there exists for each r, a unique 
automorphism A —> h, of H* such that 


(69) (h) = I,8(h)I, ~? 


for all h. h —> h, is involutive and the assignment s, which sends r to the 
automorphism h —> h,, is necessarily a homomorphism. We may thus form 
the semidirect product 


(70) A* = Hw X, H*. 
We continue to write ô for the map 
IL, h — 8(h)l,; 


ô is a homomorphism of H,,, x, H* onto Å. 
The complete inhomogeneous group Ĝ is the semidirect product of G 
and Hinay; each J, gives rise to the automorphism 


(L,x) > (1,01,,1,2) 
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of G. The corresponding map 
(ha) > (h,,1,2) 


is seen to be an automorphism of G*, and the assignment s, which sends r 
to this automorphism, is once again a homomorphism. We may thus form 


the semidirect product G*, 

G* = Hiny X;,G*. 
The group G* has a natural homomorphism ô onto G: 
(71) ò : IL, h, x — (lh) x). 


It is useful to determine explicitly the automorphisms h —> h, For 
c=st, I,,D1,~1= L so that 


(72) ha = h. 


Suppose now that r =t so that J, is time inversion. It is clear that I, = — F, 
F being the matrix of the fundamental form. Hence 8(h~1*)=(8(h)~ 1)! 
(cf. (55)) =Fê(h)F so that 


(73) h = h-1*, 
As I,J,=I, we have 
(74) he = hy. 


Orbits in Momentum Space. The space Rt is a locally compact abelian 
group under addition and hence possesses a dual group. We identify this 
group with the vector space dual of Rt. To keep things explicitly separated 
we write Pt for the vector space dual of R*. Elements of P+ will be written 


(75) P = (Po-Pi: P2P3) = (PoP), 
and we shall define the canonical duality between Rt and P* by 
(76) {v,p} = ToPo — T Pı — T2P2— T3 Pz. 
We associate with each peP‘ the character 
P : x —> exp i{x,p} 


of R‘. The mapping p> maps Pi isomorphically onto the character 
group of Rt. We thus identify Pt with the character group of R4. 

Since each element L of H isa matrix, it acts in Rê as well as in P*. 
In view of the definition of {.,.} and f, we have: 


(77) {Lax, Lp} = {x,p}, 
(78) p(L-*x) = (Lp)~(2). 
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We now compare (77) and (78) with (127) of Chapter VI, and conclude 
that p -> Lp gives the adjoint action of L on P*. It is customary to refer 
to Pt as momentum space. 

The group Hf acts on P* and converts P* into a H-space. Pt then breaks 
up into H-orbits and it is necessary to compute these. It is simplest to 
deal with the H-orbits. We introduce the following subsets of P*: 


Xm* = {P : Po? — P? — P? — p3? = mM”, po > 0}, 


(79) 
Xm = {P : P? — P? — po? — p3? = mM, Py < 0}, 


where m is real and > 0. We write 
Xoo = {0} . 
Also, for any m>0, we write 


Yin = {P : P? — Pp? — Pp? — Ps” = —m’*}. 


Xn,* UX,,~ and Y,, are obviously invariant under the whole homo- 
geneous Lorentz group H, in particular, under H. We claim that X,,* 
and X,,~ (m20) are invariant under H. If m>=0, the consideration of the 
mapping (P1, PaPa) > + (p1? +P2?+Ps2)"” shows that X,* and Xp” are 
the connected components of X,,* U X,,~. Since H is connected, it must 
leave each of them invariant. For m>0, X,,* and X,,~ are homeomorphic 
to R?; for m=0, to R°—{(0,0,0)}. Thus X,,+ (m>0), Ym» (m>0), and 
Xoo are all H-invariant, and P‘ is the union of these. 

We shall next prove that H acts transitively on each of these. For real 
x, y let 


y « 0 0 

x 0 0 
(80) Liy= |.” 

0 0 1 0 

0 0 0 1 


If y>0 and y?—2z?=1, it is clear that L, „ belongs to H. Consider now 
X,* with m>0 and a point p on it. The point (m,0,0,0) lies on X,,*. 
Choose now x=(p — m?) ?/m, Yy=Pə/m in (80); since po>O and 
Po -M =p;? +p? + pz", x and y are.real and y>0. Then L= L}, sends 
(m,0,0,0) to a point of the form (7o,7,',0,0). Now p,'*=p,7+ p27 + p3? 
so that we can find a rotation R e K sending (P9, P1',0,0) to (9,21, Pe; P3). 
This shows that RL sends (m,0,0,0) to p. H is thus transitive on X,,*. 
For X,,~ we consider (—m,0,0,0) and use a similar argument. Consider 
next Xot and a point pe Xq*. (1,1,0,0) Ee X,t and, if we choose x= 
(Po27—1)/2p5 and y=(po2+1)/2p, in (80), L= Lz. sends (1,1,0,0) into 
(Po,Po,0,0). Once again, there is a rotation Re K such that RL sends 
(1,1,0,0) to p. A similar argument, using (—1,—1,0,0) instead of (1,1,0,0) 
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establishes the transitivity of H on X,)~. The transitivity of H on Xo 
is trivial. 

We now take up the Y,, (m>0). Let p= (Po Pi: P2:P3) € Ym; Po need 
not be >0. If we choose x=po/m and y=(po7+m?)4/m, then L= 
L,,., € H and sends the point (0,m,0,0) of Ym into (po,my,0,0); as m?y? = 
Pı? +po7+ ps5", there is a rotation R e K such that RL sends (0,m,0,0) to 
p. This proves that Y,, is transitively acted on by H. 

The stability subgroup of H at (m,0,0,0) or (—m,0,0,0) (m>0) is the 
rotation subgroup K of H. The stability subgroup at (0,m,0,0) is the 
subgroup of H consisting of all matrices L of the form 


doo YO Age Ags 
0 0 


l 
doo O dzz zg 
0 


where 
Aon > O 


and the 3x3 matrix (a,,); ;=0,2,3 has determinant 1 and preserves the 
form 

G’ 3 Xo? — Xo? — 5". 
In our usual notation (cf. Chapter V), we denote these by H...,, oo) 
and Ho m.o,0). Lhe first two are compact and connected. The last is not 


compact. All are semisimple. For the action of the covering group H* on 
Pt, the orbits are the same. For the stability groups, we have: 


(31) At. m,0,0,0) = K*. 


For X,* the nature of the stability subgroups changes radically. It is 
most convenient to construct these corresponding to the two points 
(+1,0,0,+1). We consider X,* and the pre-image in H* of the stability 
group at (1,0,0,1): 


(82) Hž 0,0,9 = 987 *(Ha,0,0,1): 
Using (46) we find that m € HŠ 0 0,1) if and only if 
83 mf ym _ k o) 
(83) 00)  \o0 


HŠ 0,0,1) ì8 thus the group E* of all matrices m of the form 


(84) m, a = 6 | (a,z€C, |z| = 1). 
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{m;a : a E C} is a normal subgroup isomorphic to the plane, {m, o : {z|=1} 
is isomorphic to T, and £* is the semidirect product of these two where 
the action of z on a is given by a—>2z?a. Therefore H*/{+1} (1 in the 
denominator is the 2x2 unit matrix) is isomorphic canonically to the 
Euclidean group in the plane. In other words Ha,o,0, is isomorphic to 
the group of Euclidean motions of the plane R?. The groups H* 9 9 1, and 
H43.0,0,1) are not semisimple. They are, in fact, solvable. The calculations 
are similar for the case of X,.~; the stability subgroups are the same. 

We note that our argument showing that H is transitive on X,,* 
established something more. It actually showed that given any p € X,,* 
there is a rotation Re K and a Lorentz transformation Ly , (z,y>0, 
ef. (80)), such that RL, , sends (m,0,0,0) to p. As the stability subgroup 
at (m,0,0,0) is K, we conclude from this that any L e H can be written as 


(85) L = RZS, 


where R and S are rotations, and Z=L,, for suitable x>0, y>0 with 
y*—2z*=1. If we define 


(86) v = aly, 

then Ly, , becomes the transformation connecting two inertial frames, the 
second of which is moving uniformly with respect to the first with velocity 
v (cf. (18)). This shows that the general Lorentz transformation is a 
product of the rotations and transformations to moving frames of the 
form (18). We have already used this fact in Section 1. 


The Invariant Measures on the Orbits. All the H-orbits admit invariant 
Borel measures. This is obvious for X,,* and the Y,, (m>0Q) because both 
the group H and the stability groups are semisimple and hence unimodular. 
For X,+ we note that the Euclidean group is unimodular so that each of 
X,* also admits an invariant measure. We shall now calculate these 
measures explicitly. 

Choose a real number a>0. For any ¿>a let D, be the H-invariant 
open set Dz={p : Py >0, A< Po? — p1? — po? — pz? < §}. Let 


Pf.E) = > f(pidp = (fe C,(P*)). 


Since det L=1 for Le H, F(f*,é)=F(f,é) for all ¿>a and L eH. More- 
over, if f20, F(f,€) increases with £. Hence, on writing 


If) = a FUE) (E > a), 


we see that f—J,(f) is an H-invariant nonnegative linear functional on 
C(P*). It therefore determines an H-invariant Borel measure on P4, 
On the other hand, the transformation 


S : (Po,P1,Pa,P3) > (Po? — P1? — P? — Ps”, P1 Pas P3) 
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is one-one on D, so that we also have 


_ (Eò f Ketele, p) 
roa fi (G irp e 


(|p|? =p? +p2+p3°). From this we obtain 


_ f £E+1pl?)*”, p) 
N = | Seria 


This formula shows that the measure defined by J m2 is actually supported 
by the set X,,* and is thus the invariant measure on the orbit X,,*. We 
write «,,* for this measure. Thus 


fim? + |p|*)*®, p) 4 


(87) zat et S Jo Dea pe 


Note that as m — 0+, the integral on the right remains finite, since the 
function p —> |p|? is integrable around the origin in the space P’. Hence 
the same formula defines the invariant measure on X,t also. Note that 
X,* is not closed in Pt and yet the invariant measure on X*+ is actually 
a Borel measure on P*. 

A similar computation leads to the following formula for the invariant 
measure a,,~ on Xm: 


— 2 + 2 12 


We may follow an analogous method for determining the invariant 
measures Bm on the orbits Y,,. We choose a and £ such that a< €<0 and 
define D.’ as the open set {p : a <P? — pı? — p2 — p3? < €}. The trans- 
formation S is not one-one on D,’ but one-one separately on D; A 
{p : Po> 0} and D.’ A {p : po <0}. Hence: 


: fiu + |p|?) p) +f (— (u+ |p|?)*”?, p) 
do = a Ne an bdu. 
ft P f (a 2u + |p|?) p) “ 


We may now proceed as in the earlier case and obtain the formula 


_ f((—m? + |p|?)*?, p)+f(—(—m? + |p|?)"/, p) 
(89) h fdn = i 2(— m? + |p|?) dp 
for all feC,(P4). 


The foregoing observations are collected together in the following 
theorem. 


Theorem 9.2. The group H acts transitively on Xm*, Xm (m20), Xoo; 
and Ym (m>0) which, as m varies, exhaust the orbits of H in momentum 
space. The stability subgroup of H* at (+m,0,0,0) ts the unitary subgroup 
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K*, while the stability subgroup at (+1,0,0,+1) ts the subgroup E* (85) of 
H*, Each orbit admits invariant measures. If we use pi, Po, pz as global 
coordinates on X,* and Ym, the invariant measures «,* and Bp are given 


by 


dp,dpdp 
da t ~ nA > 
“m 2(m? + p,? + p27 +p3")'!? (m 2 0), 
dpm dp:dpadps (m > 0, |p| > m). 


~ 2m? + py? + po? +p) 


3. THE REPRESENTATIONS OF THE INHOMOGENEOUS 
LORENTZ GROUP 


The inhomogeneous Lorentz group Q as well as its covering group (* 
are semidirect products. Moreover, from theorem 7.40 we conclude that 
all the multipliers of G* are exact and also that every irreducible 
projective representation of G is induced canonically from a unique 
irreducible representation of G*. The group G* is a semidirect product and 
to it we apply the theory of Chapter VI, especially theorem 6.24. The 
orbits of H* in Pt have been calculated in Section 2 and it is obvious that 
the orbit structure is smooth. Furthermore, the stability subgroups of H* 
associated with these orbits are described in (81) and (84). Theorem 6.24 
therefore enables us to describe the irreducible unitary representations of 
G* completely. In what follows we shall carry out such a description, 
arranged according to the various orbits. 

The Orbit Xo. Since Xo.>={p : p=0}, the stability subgroup is the 
whole group H*. The irreducible representations of G* associated with 


Xoo are obtained by selecting an irreducible unitary representation m of 
H* and defining the representation U°®” of G* by: 


(90) [790.7% : h, x —> n(h) (h e H*). 


The Orbits X,,* (m >0). We select ( + m,0,0,0) as representative points 
of X,,*. The stability subgroup of H* is the unitary subgroup K* whose 
irreducible unitary representations are the representations D/ 
(j=0,4,1,...) described in Chapter VI. Let us choose a strict’ (H*,X,,*)- 
cocycle ø’ whose values are unitary operators in the space W’ of D’ such 
that g’ defines the representation D’ of K* at (+m,0,0,0). Let 


(91) HEI = G(X mnt W amt). 
For ze R* and he H* we define the operator UP +i of "+5 by 


(92) (Uii Sp) = exp i{x, p}p'(h,d(h)~*p) f(8(2)~*p); 
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here {.,.} is defined as in (76) and (h) is the Lorentz matrix corresponding 
to he H*. In view of our discussion in Chapter VI we know that 


(98) U™ +i: h, a> Uma 


is the irreducible representation of G* corresponding to the orbit X,,* 
and the representation D’ of K*. 


The Orbits X,+. The stability group at the points (+1,0,0,+1) is the 
set of all matrices (z, a e C, |z| =1) 


tea = (G a) 
M, a = . 
7» 0 z7? 


We have written H#* for this group. Our first objective is to determine 
the irreducible unitary representations of E*. Now, E* is itself a semi- 
direct product so that this can be done with the help of the theory of 
Chapter VI. We write A ={m,,,}, T” ={m, o}. Clearly 


Mea 
(94) 
Me 9M, gMz,9 = My, za 


= Mi az™Mz,05 


Thus, with respect to the action z,a—2%a of T” in A, H* becomes 
identified with the semidirect product of A and 7”. 
For each complex number b, 


Yo : Mi a — exp i Re(ba*) 
is a character of A, and b —> y, is an isomorphism of the additive complex 


number group onto Â. The adjoint action (cf. (127) of Chapter VI) of T” 
on Á is given by 
Mz,0 + Yo > Yz% 
and the 7”-orbits in A are 
Co = {Yo} 
and 
O, = {m : |b] =p} (p> 0). 

The stability subgroup at yo is T” itself; at any point of C, it is {m 41,0}. 

The representations of E* associated with yọ are simply those of T” 
lifted to Æ*. Thus, for each integer n, 


(95) Tn €: Mz a > 2" (n = 0, +1, +2,---) 


is a unitary irreducible representation of #*. On the other hand, if p>0, 
we note that the stability group associated with C, has two one-dimension- 
al irreducible representations—the trivial one, and the one which sends 
M 1,0 t0 + 1. Corresponding to these, there are two inequivalent irreducible 
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representations m, and m, of H*. Since 7, is associated with the trivial 
representation of the stability group corresponding to C, its description is 
immediate. For describing m,’ we must first determine a strict (7’,C,)- 
cocycle with values in T which defines the homomorphism m,;,9 —> +1 at 
a point of C,. It is obvious that the cocycle 


(96) Mo Y >z (yeC,) 


has the required properties. We thus obtain the following description of 
m, and 7,’. Both of them act in 


(97) X = LT dt), 

and 

(98) (mp(mz,a)f)(t) = explip Re(tz~*a*)] f(z~*t), 
(99) (Tmo (Mza) f)(t) = explip Re(tz~*a*) ef (2-74); 


we have simply taken into account the formulas (94) and applied (132) 
of Chapter VI. 

Having determined the irreducible representations of E* we can now 
obtain the representations of G'* which are associated with the orbits X,+. 
We select for n=0, +1, +2,--- a strict (H*,X,*) cocycle with values in 
T, say ọọ”, which defines the representation: 


Tn > Mz a > 2” 
at ( + 1,0,0, + 1). Then, in the Hilbert space 
(100) Ht” = GF7(X_*,a9*), 
we have the representation U +°" of G* given by: 
(101) (Uiz Sfp) = exp tx, p}p"(h,8(h)~ *p) f(8(h)~*p). 


Furthermore, choose for each p, strict (H*,X *)-cocycles œ? and o°” 
which take values in the unitary group of # = #*(7,dt), and which define, 
at (+1,0,0,+ 1), the representations 7, and 7,’ of #*. The corresponding 
representations U +°, U+?” of G* act in 


(102) HWE = LUX, * Hat) 
and 
(103) (URLIP) = exp i{x, p}p(h,8(h) ~*p) f(5(h) ~*p), 


(104) (Ue? fp) = exp x, p}p?"’(h,8(h) ~*p) f(8(A) ~ *p). 


The Orbits Y,, (m>0). The stability subgroup at (0,m,0,0) is a semi- 
simple Lie group but is not compact. Its unitary representations other than 
the trivial representation are all infinite dimensional. The general theory 
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associates with any one of these, say 7, a representation U'™" of G*. We 
shall not bother to write these down since, as we shall see presently, they 
do not seem to be useful in describing physical particles. 

We shall now take up the question of classifying the representations 
obtained into physical equivalence classes. It is clear that two repre- 
sentations of G* are physically equivalent if and only if-there exists a 
unitary or anti-unitary isomorphism which intertwines them. Now we 
have seen in the course of the proof of lemma 8.3 that, given an arbitrary 
representation v of a group, there is a canonical anti-unitary isomorphism 
of the Hilbert space of the representation 7 onto its dual which intertwines 
m with its contragredient. It follows, therefore, that two representations 
of G* are physically equivalent if and only if they are either equivalent or 
mutually contragredient. The case of contragredience is settled by the 
following lemma. 


Lemma 9.3. Let F x’ B be a semidirect product and let 0 S B be an F-orbit 
under the adjoint action. Let F, be the stability subgroup of F at yeO. 
Then F,-1=F,,, and if O-1 is the orbit of yo}, then for any representation 
a of F, the representations of F x’ B associated with (O,7) and (O-1,7°) are 
mutually contragredient, n? being the representation contragredient to r. 


Proof. Let 7 act in a Hilbert space “7 We select a finite measure « on O, 
quasi invariant under the F action, and define, for he F, r, =da/da”~"), 
Then, the representation U of F x’ B which is associated with (0,7) canbe 
written as 


(Un of My) = {r(A >- y) P ybplh hyf (hy) (yeo), 


where ọ is a strict (',0)-cocycle with values in the unitary group of .%; 
U acts in # = £?(0,%a). Now, F acts as a group of automorphisms of the 
group Ê. From this it follows that F.,,-1,=F,, and that the map y > y~! 
transfers the measure œ on O to a quasi-invariant measure f on O~}. 
Choose any anti-unitary automorphism S of X and define J by 


(Jy) = Sf(y-*)  (ye07t). 


J is clearly an anti-unitary isomorphism of # onto #°=#7(0-1,4B). 
We compute J o U oJ~!=V. A brief calculation shows that, for f° e 4°. 
we have 


(Vr of My) = {rah yo) yS phh -yS Shy). 
It is obvious that z —> r,(27}) is a version of dg/d8*-® and that h, z —> 
So(h,z71)S7} is a strict (F,O-+)-cocycle defining the representation 
Somo NT! at yot. Hence V is the representation associated with O~? 
and Somo N7! Since Soro t~r? and Jo U oJU? (the contra- 
gredient of U), the lemma is proved. 
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Applying this lemma we see that the physical equivalence classes 
corresponding to positive m are {U™ +:1,U™ —-4}. For the representations 
associated with X,* it is clear in the first place that U+” and U->-" 
are physically equivalent. On the other hand, lemma 9.3 applied to £* 
shows that the representations 7, and 7,’ are both self-contragredient, so 
that we conclude that Ut’? and U`’? as well as U+” and U~->*’ are 
physically equivalent. We can now formulate the main result of this 
chapter. It was obtained by Wigner in his 1939 paper. 


Theorem 9.4 (Wigner [1]). Any irreducible representation of G* induces 
a projective representation of G which is irreducible and, conversely, given an 
irreducible projective representation of G there is exactly one irreducible 
representation of G* which induces it. An arbitrary irreducible representa- 
tion of G* is equivalent to one associated with some H*-orbit in Pt and 
some irreducible representation of the stability subgroup of H* at some 
point of this orbit. The irreducible representations U™ ++ (j=0, t,---), 
U+:+n (n=0, 1, 2,---), U+ and U+! (p>0) are physically inequivalent 
and an irreducible representation of G* which is associated with one of the 
orbits X,,* (m20) is physically equivalent to exactly one of these. 


Not all irreducible representations of G* can be used to represent free 
particles. Consider first of all the representations U°°-* associated with the 
trivial orbit Xoo. In every one of these, the space-time translations are 
represented by the identity operator. This implies that the energy and 
momentum observables vanish identically. Consequently, all these 
representations have to be given up, with the exception of the trivial 
one-dimensional representation of G*, in which every Lorentz transforma- 
tion goes over to the identity. It represents the vacuwm. 

We shall next examine an irreducible representation U of G* associated 
with one of the nontrivial orbits X,,* and Y,,. Let Q, be the energy and 
Qı» Q2, Q3 the linear momenta along the three axes in a given coordinate 
system. Since the space-time translation by xeR* goes over into the 
operator of multiplication by the function p — exp t{z,p}, it follows that 
the operators Q, are multiplication operators. A direct calculation shows 
that in the Hilbert space of U, 


(105) Qo? —Q17 —Q2? — Qs” = ÀL, 


where A= +m? if U is associated with X,,* and —m? if it is associated 
with Y„. If we now consider a function (scalar or vector valued) which 
belongs to the Hilbert space of U and vanishes outside a very small 
compact set, it follows that in the corresponding dynamical state, the 
momenta Qı, Q2, Qz and energy Q, have distributions with extremely 
small variance and hence we may approximately regard the particle as 
having sharply defined energy and momentum values. Now for a classical 
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free relativistic particle, the quantity Qo? — Q? —Q2? —Q;? represents the 
square of its rest mass. Consequently, the constant A must also be inter- 
preted as the square of the rest mass. This rules out the alternative 
A= — m? as otherwise it would seem that we have to admit particles with 
imaginary mass. 

This leaves us with the representations associated with X,,* (m>0) 
and the trivial representation. The representations associated with X,,* 
represent free particles of rest mass m. The form of the representations 
suggests that at least for m>0O we interpret U™ +: as representing a 
particle with spin 7. We shall see much later (Section 7) that this is indeed 
the case. In the meantime we shall define the parameter j for the U™ ++? 
and the parameter łn for the U++*" to be the spin of the particle. The 
representation U™-*+>1/2 describes the electron; the representations U™++° 
represent the mesons. The representations U*++*", Ut? and Ut?” 
represent the massless particles. We shall see later that U+-*! represent 
the neutrinos and U +'+2 the photons. 

The representations U +° and U+” correspond to infinite dimensional 
representations of the stability group #*, and hence the states of the 
corresponding particles are represented by infinite dimensional vector 
valued functions on X*+. The physical interpretation of these particles is 
somewhat unclear, as they would have to describe particles whose spin 
observables have infinite spectra (cf. Section 5). 


4. CLIFFORD ALGEBRAS 


Clifford Algebras. The customary representation of the states of a 
relativistic electron is in terms of spinor fields in space-time. Such descrip- 
tions depend heavily on the properties of the Clifford algebra and its 
representations. We devote the present section to this topic. 

Let V be an n-dimensional vector space over the complex field C. We 
shall assume n>4. Let B be a nonsingular symmetric bilinear form on 
V x V. It is known that there exist bases {v,, v,,---, v,} for V such that 


_ f0, rÆs, 
(106) B(v, w) = + r= 3? 


corresponding to such a basis we shall construct an associative algebra 
with unit element 1, which we shall call the concrete Clifford algebra F. 
We do this as follows. For each subset AGN ={1, 2,---, n} we choose an 
abstract element e, with e, =1, and define F to be the vector space over C 
with the e, as a basis; dim(/’) = 2". We shall first define a product operation 
between elements of the set e,. To do this formally, we define, for A, 
BSN, ANB=AU B-AQOQB, for ASN, jeN, let p(A,j) be the 
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number of elements i € A with îi >j; and for A, BEN, (A,B) => jes (A, J). 
Let us then define 
(107) ejeg = (—1)4 Be, yp. 
It is easy to prove that this operation is associative and that le,= 


e,l=e, for all A. To prove the associativity, let A,, Az, AgcN. It is easily 
seen that (A, A43) A 43=4; A (42 A A3). On the other hand, 


p(Ay,4g\As)= > p(Ay,j) = p(A1,4)+p(Ay,Ay) (mod 2), 


JEAQN AZ 


P(A, A4243) = > P(A, Aaj) = p(A;,43)+p(Az,43) (mod 2). 


JEAg 
From these relations we obtain the relation 
€4,(€ 4,05) = (€4,€42)€43 


at once. Since the e, span F, it follows that F becomes an associative 
algebra under the product operation defined by 


(108) È Pala 2 gaea = D (LAPD aeea 


The mapping which sends v, to e; extends to a linear isomorphism of V 
into F. Let us write 7 for this linear map and write e,=7(v,). From (107) 
we then obtain 


(109) e? = l, ee, +e, = 26,,, 
and for A ={i1, %,---,7,} with 1<1,<1,<---<i,<n, 
(110) e4 = Eiig Cip 


Thus F is generated by the e, and the generators satisfy the relations (109). 
From (109) we obtain the following equation: 


(111) t(v)? = B(vw)l (ve V). 


The construction of F which we have given here is not an invariant one 
since we have used the basis {v,,---, v,} explicitly. We shall now indicate a 
more abstract construction. Let Z be the tensor algebra over V and let $ 
be the two-sided ideal in © generated by the set of all elements of the 
form 


(112) a, = v Q v— B(vw)l. 


We observe that 3 is proper. In fact, consider the concrete Clifford algebra 
F associated with a basis {v,,---,v,} satisfying (106). By the basic 
property of the tensor algebra ©, there exists a homomorphism, 7 say, of 
T into F, such that t=7 on V. Since 7[V] generates F, 7? maps Ẹ onto F. 
Since r(v)? = B(v,v)1 in F, #(a,)=0 so that E kernel (7). This proves that 
9 is proper and, indeed, that dim T/9> 2". 
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We shall now demonstrate that 7 has precisely ¥ as its kernel. This will 
be done if we show that dim T/$ <2". Consider the basis {v,,---, Vn}. 
Every element in ZT is a linear combination of elements of the form 
vn, O: Quv, Now, v,@v=1 and y @ v,;= —v; @ vy, modulo 9, and 
hence it is obvious that v, ®@---@ v,,= +w modulo ¥, where w is of the 
form v, @---@ v, with 1<j,<jg<-+-+<j,<n. For a given s, there are 


(*) such tensors and hence dim (&/3) < (5) + (7) +-+- =2". This proves 


that X is the kernel of 7. 

Let r be the canonical homomorphism of © onto T/X. We write @ for 
T/3 and call it the Clifford algebra of V associated with B; when there is 
no doubt as to what B is, we omit the references to B. Since 7 is identical 
with r on V and since 7 is a linear isomorphism on J, it follows that 7 
maps V isomorphically into %. 

The Clifford algebra @ has the following universal property; if 6’ is an 
associative algebra with unit 1’ and 7’ is a linear map of V into @’ such 
that (i) r'[V] generates @’, and (ii) 7’(v)? = B(v,v)1’ for all v e V, then there 
exists a homomorphism 7~ of @ onto €’ and one only, such that the 
diagram 


A 
(113) V r~ 
5N y 


commutes; for there exists a homomorphism 7’ of & into @’ such that 


= 


7 =r on V, and as 7[V] generates @’, 7 is surjective. By (ii) above, 
a, E kernel (7’) so that Yckernel (7’) and hence 7’ induces a unique 
homomorphism r~ of @ onto @’ such that (113) commutes. Furthermore, 
this universal property determines the pair (@,7) up to isomorphism. In 
fact, let 6’ be an associative algebra with unit 1’ and 7’ a linear mapping 
of V into €’ such that (i) m'[v] generates €’ and 7’(v)?= B(v,v)1’ for all 
ve JV, (ii) (@’,7’) has the universal property mentioned above. Then there 
are homomorphisms 6 and 6’, such that @ maps @ onto @’, and 0’ maps 


€ onto @, and the diagram 


E 

i 

(114) y 0) |e 
TN ¢ 


commutes. Diagram (114) shows that 0 o @’ and 6’ o @ are both identities 
and hence that 0 and 0’ are isomorphisms. 
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The invariant construction of @ enables us to associate automorphisms 
of € with B-orthogonal transformations of V. More precisely, let L be an 
invertible endomorphism of V such that 


B( Lv, Lv) = Biv,v) 


for all v. Clearly, there exists an algebra automorphism L~ of T which 
extends L. It is obvious from (112) that a,,=L~(a,) for all ve V from 
which we easily conclude that L~(3)=. This proves that L~ induces an 
automorphism of @. We write this automorphism as x —> zx! (xe @). 
Since L — L~ is a homomorphism, we conclude that the mapping, which 
assigns to L the automorphism x —> x" of @, is a homomorphism of the 
orthogonal group of B into the group of automorphisms of @. Note that 


(115) n( Lv) = m(v)é. 


Simplicity. The central fact in the theory of Clifford algebras is that when 
dim V =n is even, @ is simple. We shall now proceed to prove this result. 
Suppose that 2 is an arbitrary nonzero two-sided ideal in @. Any nonzero 
element u € 2 can be written uniquely as 


(116) u = pol + > Pa,- T (0) ml). 
1sty<ig<---<iysn 

We shall choose u to be a nonzero element of 2 with the smallest number 
of nonzero p,,,... ;, in its-expansion (116). Now, a direct calculation shows 
that when r is odd, m(v;,)- - -2(v;,) and m(v,;) commute if 7 is one of 7,,---, t, 
and anticommute if 7 is none of 1,,---,1,; whereas, when r is even, the 
situation is exactly the reverse. Consequently, when n itself is even and 
there is at least one p,,... „#0 in (116) above, one can find s, (k,,---, ka), 
and some Jj, such that p,,,... x, #0 and m(v,,)---m(v,,) anticommutes with 
a(v,;). AS a(v,)[a7(%,) > m(n) = + m(v;,,)- + +7 (v,,) any way, we obtain 


(117) a(vj)um(v;) = pol + > Pirog) (0) 


(izs-++.tp) Æ (kist ks) 
— Praska T( Ve) ++ T(Uk,), 


where 9j, ,....4,= Pau,- Writing u’ = 3(u + n(v)unr(v,)), we get, on noting 
that w e 2, 


w = pol + >o o Pinang) nO), 
CERR RETI TEZA es) 
where Ph, =$(Pu, -1 Pun, ) Since the number of nonzero 
Pi- 18 strictly less than the number of nonzero p,,....;,, we have a 
contradiction to the minimal nature of u. This shows that all p; ,... must 
vanish in (116). But then p#0 and hence 1 e2, showing that =. 
We have therefore proved that @ is simple when n is even. 
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The Spin Representation. By the classical Wedderburn structure 
theorem (cf., for example, Weyl [2], pp. 79-96, 280-290), there exists, 
up to equivalence, exactly one finite dimensional irreducible representation 
of the simple associative algebra @ when n is even. Let us select a repre- 
sentation of this unique equivalence class and denote it by p. It is known 
that in the left regular representation of @, p occurs exactly as many times 
(as a direct summand) as its dimension. Hence 


(118) dim p = 2" (n even). 


Let us write È for the vector space in which p acts. Let G be the orthog- 
onal group of V and, for any Le Gp, let x —> x" be the automorphism of 
g induced by L. Then x —> p(z*) is also an irreducible representation of ¢ 
in È. Consequently, there exists an invertible linear transformation S, 
of % such that 


(119) p(x") = Sy p(z)S,~* 
for all xe @; note that this is guaranteed as soon as we have 
p(7(Lv)) = Siplr(v))S 


for all v e V. We may even choose S, such that det S, =1. Now, by Schur’s 
lemma, S, is determined by (119) up to a constant multiplying factor. 
Hence, if L, L’eG, and we choose 8,,8,', and S, such that (119) is 
satisfied for L, L’, and LL’ with S,,S,', and So, respectively, we obtain 
the relation 


(120) So = oNN (o E C); 
the determinant condition on Sy, Sı, and S,’ implies that |o|=1. 


Let G be the lese group of all invertible linear transformations of & of 
determinant 1. The set of all (Z,S,) E Gs x Gy, such that 


p(7(Lv))S, = S p(x(v)) 


for all ve V, is obviously a closed subset of G,xG,;, and the map 
(L,S,)—> L is continuous and maps this set onto G,. Hence, by the 
Federer-Morse lemma (cf. Chapter V), there exists a Borel map 
L —> 8,(L) of Gp into Gy such that (119) is satisfied by S,(Z) for all x e 8. 
But then (120) implies that L—8S,(Z) is a projective (nonunitary) 
representation of G, in 2. We have 


S,(DL') = m(L,L')S,(L)8,(L') 


for all L, L’ e G,. The associative law L(L’L")=(LL')L” shows at once 
that m is a multiplier for Gs. Since G, is semisimple, we may use the theory 
of multipliers (Chapter VII) to lift S, to a representation S‘ of the covering 
group G',* of Gs. As G,* does not have any one-dimensional representa- 
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tions other than the trivial one, S’ is uniquely determined by 8. 
S’(¢ —> 8'(@)) is thus the unique representation of G,* in & such that 


(121) p(m(L’v)) = S'(Z)p(m(v))S'(Z)-* 


for all ve V, f —> L’ being the covering homomorphism of G* onto Gp. 

The elements of & will be called spinors and the representation 
S'(?—> S'(?)) and its irreducible components are called the spin represen- 
tations. We treat the elements of È as determining a new type of quantity, 
say q, such that when V undergoes the orthogonal transformation L”, q 
undergoes the transformation S’(/). It was E. Cartan who discovered, in 
infinitesimal form, the spin representations; the global version was con- 
structed by H. Weyl. Cartan also proved that the spin representations of 
G,* cannot be obtained by decomposing the tensor algebra over V for 
n > 4 with respect to the natural action of G',*. The reader who is interested 
in a deeper study of these matters should consult Cartan’s book [1], 
Chevalley’s account [4], and the article of Brauer and Wey] [1]. 

We shall now extend these results to the case of real vector spaces and 
quadratic forms. Let V° be a real vector space of dimension n, n being 
even. Let B° be a nonsingular symmetric bilinear form on V° x V°. We 
write V for the complexification of V° and B for the bilinear form on 
V x V which extends B°. As n>4, the group of invertible endomorphisms 
of V° of determinant 1 which leave B° invariant, is semisimple. We write 
Gpo for the connected component of the identity of this group. Identifying 
each endomorphism of V° with its natural extension to V (as a complex 
endomorphism), we have G,oCG,. Let G,o* be the universal covering 
group of Gpo. Then, denoting by & the space of all spinors, we have a 
projective representation 


Si: b> S,(L) 
of Go in È such that p(a(Lv)) =S,(L)p(m(v))S,(L)~? for all Le Go and 


ve V°. Since Go is semisimple we may argue as before to deduce the 
existence of a unique representation 


l —> (L) 
of G,o* in & such that 
(122) p(7(L/v)) = S°(¢)p(a(v) )S°(Z)-* 


for all ve V° and f € G,o*, f > Lf being the covering map. It is usual to 
call S° and its components also as spin representations. 

It is usual to exhibit S° in a particular way involving the so-called 
Dirac y matrices. We shall select a basis {wo,---, w,—1} for V? such that 


(123) B°(w,,Ws) = ErÔrs 
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where 


+1 0O <rx< 
(124) er 5 t (p <r <n-1). 
If F is the matrix diag{eo, £€1,***, €&n-1}, then Gpo can be identified with the 
connected component of the identity of the group of all matrices L such 
that LFL= F. Since F-1=F, we can argue exactly as we did in the 
Lorentz case to conclude that L'e Gpo, whenever L e Gpo. Since Gpo* is 
the universal covering group of G0, there exists an anti-automorphism 


(125) f—>¢* 


of G,o* such that it induces the anti-automorphism L —> Lf of Gpo 
(cf. (55)). If we write 


(126) S(?) = S(¢*)~* (l e G,0*), 


then it follows from (122) that f —> S(f) is the unique representation of 
G,0* in & such that 


(127) SE) p(ar(v))S(@) = prl (Lv) 
for all v e V° and fe G,0*. Let us now write 
(128) Yr = plr(w,)). 
Then, as p(7(v)*)= B(v,v)1 for all ve V°, we have 

2 — — 
(129) { yre=el (OS 7rsn—l), 

Vist YsYr = 0 (r # 8). 

Conversely, given endomorphisms yo,:--, y,_, of some vector space 2’ 


satisfying (129), there exists obviously a unique representation p’ of the 
Clifford algebra @ in È’ such that p’(z(w,))=y,; in fact, we need only to 
define p’ by this equation on V and extend it to @ using the universal 
property (113). p’ will then be equivalent to a multiple of p. Hence dim 2’ 
will be a multiple of 2. If dim &’=2"?, then p’ will itself be irreducible, 
leading to the irreducibility of the set {yo,---, y,-,}. In this case, the 
equation (127) becomes 


n-i 
(130) SAYS) = È, anla 

s=0 
where a,,(L) is the r-sth matrix entry of L with respect to the basis 
{Wo;° a) Wy -1}- 


We summarize our discussion so far in this section in the form of the 
following theorem. 


Theorem 9.5. Let V be a complex n-space, n>4, and even Let B bea 
nonsingular symmetric bilinear form on Vx V. Then, the Clifford algebra 
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over V associated with B is simple, and has, up to (linear) equivalence, 
exactly one irreducible finite dimensional representation p; dim p=2™2, 
Suppose now that V° is a real linear subspace of V of real dimension n 
such that the restriction B°, of B to V° x V°, is real. Let {wo,-+-, Wn-1ı} be 
a basis for V? such that the relations (123) are satisfied. Then there exists a 
set {yo +s ¥n-1} Of 2"? x 2"? complex matrices such that the relations (129) 
are satisfied. {yo, ¥15°*+> Yn—i} 18 necessarily irreducible and any two sets of 
such y’s are connected by a similarity transformation y, > Ay,A~!. Finally, 
if n> 4, there exists a unique representation S of the universal covering group 
of the connected component of the identity of Gyo such that the relations (130) 
are satisfied. 


The matrices y, are usually known as Dirac’s y matrices. 

Consider now the case when n=4 and V° has a basis {wp ,w,,wo,w3} 
such that B(wo,wo)=1, B(w,,w,)= —1 (r=1, 2,3). We are then dealing 
with a Minkowskian form. We shall now exhibit a particular choice of the 
y’s. Let 0, cz, oz be the matrices ((55) of Chapter VIII) defined by 


0 1 0 -i 1 0 
a= o FN of No = 


and let yo, Y1» Y2, y3 be the 4x 4 matrices defined by 


131 (; ) -( o) (r = 1, 2,3) 
(131) w= o =A, 0 = l, 2, 3). 


It is then a routine matter to check that the relations (129) are satisfied. 
Since the y’s are 4x4 matrices and their irreducible realization is also 
four-dimensional, {y9,71,¥2,v3} must be irreducible. In this case, Gpo is 
the connected homogeneous Lorentz group H, G,* is the group SL(2,C) = 
H*, and the covering map is what we have been denoting by m —> 8(m). 
The relations (130) become: 


3 
(132) S(m)-*y,S(m) = > a,(8(m))y, (r = 0, 1, 2, 3). 
s=0 


yo is a Hermitian matrix; y,, Y2, and y, are skew Hermitian. 

We may use (132) to analyze the representation S of H* in greater 
detail. Let §* (cf. (52)) be the Lie algebra of H*. Then, we have, from 
(132), 


3 


(133) [S(X),y,] = — 2, ds (8(X))y,  (r = 0,1, 2, 3) 


— 
= 


(for 8, see (54)). We shall now determine the matrices (X) in terms of the 
y’s. Since the y’s are irreducible, we can write §(X) as a linear combination 
of the 16 elements 1, y,, Y,Yss Yr¥s¥u YoY1¥2¥3- Lhe commutation rules 
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(129) and (133) imply easily that each S(X) must be a linear combination 
of 1 and the y,y, (r<s). Since the matrices of any representation of a 
semisimple Lie algebra are of trace 0,f and since tr(y,y,) = — tr(y,y,) =0 
for r és, S(X) must be a combination of only the y,y,. Let 


S(X)= > Xy 
O<r<ss3 
Substituting this into (133) we obtain, after a brief calculation, 
C,s(X) = 4a,,(8(X)) 
so that 


(134) S(X)=4 > — ag(5(X))yye. 


Osr<ss3 


If we now use the formula (54), we find, after some calculation, 


135 S(X) = A 0 
from which we obtain the formula 
(136) S(m) K ° 
m) = ; 
0 (m-*)* 


here the asterisk denotes complex adjunction. Formulas (135) and (136) 
display the structure of the representation S explicitly. 

The notion of a spinor field on any manifold on which the inhomogeneous 
Lorentz group acts can now be introduced. Let X be the space on which 
the inhomogeneous group G=H x’ Rt acts. Let & be the space of four 
complex dimensions on which the spin representation S of H* acts. A 
spinor field is then a function % from X into % with the following trans- 
formation property: if we transform X into itself by the element (8(/),2) 
where x is the translation and 6(h) is the homogeneous component, then 
p transforms into the function ~’, where 


(137) ply) = S(A)p((8(h),2)-*-y) (ye X). 


5. REPRESENTATIONS IN VECTOR BUNDLES AND 
WAVE EQUATIONS 


The descriptions obtained in Section 3 for the representations of the 
inhomogeneous Lorentz group are not quite suitable for making explicit 
some of the physical features of free relativistic particles. Also explicit 


t Since any element of the Lie algebra is a sum of elements of the form [X,Y] (cf. 
Chevalley [3], p. 67). 
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calculations become difficult since the formulas of Section 3 involve 
certain cocycles whose existence was only proved abstractly. Our first 
concern in this section is to give more geometric descriptions of these 
representations which will be free of the disadvantages mentioned above. 
In each case we shall construct an appropriate vector bundle on the orbit 
in question, and exhibit the states of the particle as square integrable 
sections of this bundle. These results were first rigorously obtained by 
Bargmann and Wigner [1]. We shall follow their discussion without any 
essential modifications. 

We shall next prove that these sections can be regarded as tempered 
distributions (in the sense of L. Schwartz) in momentum space P4. This 
enables us to take Fourier transforms and obtain the Hilbert spaces as 
spaces of (tempered) distributions on R‘. The differential equations 
satisfied by these distributions will then be the wave equations. We shall 
see at this stage the famous Dirac and Maxwell equations making their 
appearance. 

The distributions in space-time which describe the states of the particles 
are actually functions at least for a dense linear manifold of states. These 
functions are then seen to transform under the Lorentz group exactly like 
spinor fields, thus tying up the theory with the usual principles of 
relativistic physics according to which the states of the free particles are 
to be represented by spin fields in space-time. 


The Representations U™ +? Corresponding to Mass m, Spin j. We shall 
fix m>0 and consider the orbit Xm* with invariant measure «,,+. We 
introduce the complex 4-space C* in which the Dirac matrices yo, Y1, ye, Y3 
act as linear transformations, and treat the elements of C* as spinors. 
The spin representation k —> S(h) (k e H*) is, of course, determined by 
the equations (132). We take for the y’s the matrices defined by the 
relation (131). We write {€9,e,,€2,€3} for the standard basis vectors of C£; 
e, has the components 6,9, Ô; 5;2, 5;3. We write <.,.> for the usual 
Hermitian inner product for C*; <e,,e,> = 4,5. 

We begin by defining the bundle By Y? as 


3 
(138) Bea = (o :PE Xmt ve C$, > DY Y = mo 
k=0 


with the projection 
(139) m : (p,v) > p. 


An easy calculation based on (132) and (55) shows that if (p,v) e Bł'!2, 
then so does (5(h)p,S(h*~*)v). Thus 


(140) h,(p,v) > (p,v)" = (8(h)p,S(h*~*)v) 
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converts B7:1/2 into a H*-space. The action is smooth and projects to 
the usual H*-action on X,,*. For any p € X,,* the fiber at p is the linear 
subspace of Ct given by 


3 
(141) Br?(p) = foa : 2, Peye” = mo}, 
k= 


For the point (m,0,0,0), (141) reduces to the equation 
YoU = V. 


From our formula (131) for yọ we see the fiber at (m,0,0,0) is the two- 
dimensional subspace spanned by e,+e, and e,+e,. Hence (141) also 
defines a two-dimensional subspace. 

We shall construct next a family of covariant inner products on the 
fibers of the bundle. From the formulas (136) for S(h) we obtain 


(142) S(h)*yS(h) = Yo 


after a trivial calculation. Hence the representation S leaves invariant 
the Hermitian form v > m~‘<yov,v> in C+. This form is actually positive 
definite on each fiber of the bundle B12. To see this, we consider a 
(p,v) e B312. We have: 


3 


> PiXy40s0> = mow). 


k=0 


Now, <79v,v> is real as yo is Hermitian, and <y,v,v> is purely imaginary, 
as y, is skew Hermitian. Hence, as the right side of the above equation is 
real, we must have >3_, p,<y;,U,v> =0. We may therefore write 


Poy Vv) = mvv) 
or 
(143) MIY) = Po uwy) (ve B,,**"7(p)). 


Equation (143) exhibits explicitly the positive definite character of the 
form v > m~ yw wy on the fibers of Bx +1/2, 

We now introduce the Hilbert space 2# +:12 of Borel sections of the 
bundle Bł:12. The value of |p|? for a section o(p —> 9(p)) is simply 
the integral 


l, Po~ *<P(P)9(P)>d an * (p). 
Now, using the formula in theorem 9.2 for a,,+, we can rewrite this as 


(144) leit = | <olp).e(p deep), 
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where 
dp,dp.dps 
+1(p) = . 
(145) da (P) = p2 p + ps? F m?) 
The representation U of G* in #°"? is given by 


(146) (Un.zp(p) = exp i{x, p}p(d(h)~*p)’, 


where o(p)” is defined by (140). 

To identify U with U™+12 we use theorem 6.20. We must accordingly 
determine the representation of the stability group K*, induced on the 
fiber at (m,0,0,0). This fiber is the subspace of all vectors of C* of the form 


(") where v € C?; and for h e K*, as h=h*~*, (136) and (140) give us the 


) 

v hv 

This shows that the representation of K* on the fiber at (m,0,0,0) is just 
D2, In view of theorem 6.20, we may conclude that the representation 
defined by (146) is equivalent to U™++1/2, 

We now proceed to the case of higher spin. We consider a half integer j 
(> 4) and write 27=N. For each v=1, 2,---, N we define y,” (r=0, 1, 2, 3) 
to be the operator in the N-fold tensor product Ct @---@ C*#=C*O% 
which sends U, Q: -Q Uy to UW Q- Quy, where u’ =u for iv 
and u,’ =y,u,: in the obvious notation, 


(147) ¥7 =1@::-@y@1®@:::@l (l<v< WN). 


action 


For p € Xm* we define By +4(p) as the set of all (p,t), where t belongs to 
the subspace of C*®” consisting of all symmetric tensors t such that 


3 
(148) > pyt=m (v= 1,2,-+-, N). 
r=0 
We define Bt +? by 
(149) Ba = U Br(p). 
PEXmt 


It is easy to verify that B$ '(p) is the subspace of the symmetric tensors 
which belong to Bt +1/?2(p) @---@ Br+1/2(p), 
For (p,t) e B}! and h e H*, we define 


(150) (pt) = (8(h)p,Sy(h*~")t), 
where 


(151) Sy(h) = S(h) @---@ S(A). 
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Sy(h*"") maps the space By:1?(p) @---@ Byt1/(p) onto the space 
By ?(8(h)p) @---@ By'/2(5(A)p), and leaves the space of symmetric 
tensors invariant. Hence (150) converts Bý! into a H*-space. Further- 
more, S, leaves invariant the Hermitian form 


t—> my O'Q Yot, Oy; 


where <.,.>, denotes the natural Hermitian inner product in C48; 
(143) shows that for all t e Ba’? (p) @---@ Byi+1?(p) one has 


m-"<(¥9 @-++@ volts Ov = Po ttn 


so that this Hermitian form is positive definite on B*+1/?(p) Q- 
Q By’*'?(p), and hence, in particular, on each fiber of B}. The repre- 
sentation of the stability group A* induced on the fiber at (m,0,0,0) being 
the natural one carried by the space of symmetric tensors on the fiber of 
Br 2 at (m,0,0,0), is equivalent to D’ (cf. Chapter VIII). We now define 
H ++) to be the Hilbert space of Borel sections » of Bł! with 


(152) lol? = Í , <olp).op)>xdBe-™P) 
where 

dp dpd 
G59 ARRO = pa ppe AO 


The representation U is defined by 
(154) (Un2P)(p) = exp tx, p}p(3(h)~*p)". 


U ~ Um +i by theorem 6.20. 

The reader might have noticed that we have not treated the spinless 
case in terms of the spinor calculus. This can also be done provided we 
consider, instead of the bundle (149), the bundle where the fibers are made 
up of skew symmetric tensors. More precisely, let 


(155) BY = {(p,t): peX,,*,teC* Q Ct, t skew symmetric, 
(È Pyt = m, v = 1, 2}, 


with the norm defined for any Borel section » of the bundle by 
(156) lol? = |, <rla)olp)>adBt 70), 


The representation of G* is then defined in the same way as (154). Fora 
given pe X,,*, the fiber of B’? at p consists of all skew symmetric 
elements of Bx :1/*(p) @ B,*!/?(p) which form an one-dimensional space. 
As the unitary group K* has no nontrivial one-dimensional representations, 
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it induces the trivial representation on the fiber of B}'° at (m,0,0,0). 
Hence the representation of G* defined in the Hilbert space of sections of 
B}? is equivalent to U™ *>°, 


The Representations U+>+", These representations may be obtained 
by passing to the limit in U™ + as m > 0+. We write 


3 
(157) By* = {pm : PEX t vel, > Pyt = o}. 
r=0 


B,* is a bundle over X,* and the action h,(p,v) —> (p,v)", where 
(158) (pw) = (5(h)p,S(h*"*)v) 


converts it into a H*-space. For any m>0, the fiber of B}? at p™ = 
((1+m?)+/2,0,0,1) is spanned by the vectors 


v™ 


fe, + (1+ (1 +m?) 2)eg, 
H(1+ (1 +m?) "eg + fme, 


which converge, as m —> 0+, to e} and ez, respectively. Now e3 and e, 
span the fiber of B,* at (1,0,0,1). Since H* is transitive on X,,* and X,+, 
the same is true at any other point: if p e€ X*+, there are points p™ e X,,* 
which converge to p as m —> 0+, with the property that any vector v in 
the fiber of B,* at p can be expressed as lim v™, where v™ belongs to the 
fiber of B412 at p™?. However, all this could be done with — m instead of 
m, showing that the bundles Bt;'/? also collapse to Boyt. On the other 
hand, the endomorphism 


(159) I = tyoyi1y2ys 


transforms B*>1/?() into B*;1/2(p) for each pe X,,*+ for any m>0, as is 
easily seen from the fact that I‘ anticommutes with all the y,. In the limit, 
T leaves the fibers of B,* invariant, leading to further degeneracies. 

We calculate quickly that 


1 0 
(160) T = (o 4): 


Hence I’ commutes with all S(h) (cf. (136)). Thus, if (pv) € Bot, (p,Tv) € 
B,*. On the fiber at (1,0,0,1) T has the eigenvalues +1. Hence the same 
is true on all the fibers. We may now define the bundles BE’ +12 by 
(161) Bg? tt? = {(p,v) : (p,v) € Bot, Tv = Fr}. 


H* leaves both Bł’ +12 and Bj’~1/? invariant. A simple calculation shows 
that for the stability group Æ* at (1,0,0,1) the representation induced on 
the fibers at (1,0,0,1) becomes 


(162) M, a > 2h. 
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We have already seen that the Hermitian form 


v —> Po 7 1¢v,v> 


is left invariant by the S(h) and is positive definite on the fibers of B+ »1/2, 
Therefore, letting m —> 0+, we may conclude that the form is still 
invariant. At (1,0,0,1), the fiber of B,+ is spanned by e3 and e, on which 
the form considered is even positive definite. Hence, it is positive definite 
on each fiber. We are thus in a position to apply theorem 6.20. Wt. +18 
is the Hilbert space of Borel sections » of B4’ +*2 with 


(163) lol = | a 
where 
dp,dpodp 
164 dBt> = LL t2 T3 
(164) fo” (P) 2( P1? + Po" + P3?) 
The representation U, which is equivalent to U +'+1, is defined by 
(165) (Un, 29)(p) = exp Ñz, p}p(3(h)~"p)”. 


For higher n, we proceed as in the case of nonzero mass. We write for 
n>l, 


3 
(166) Bet? = fion : pE Xot, te C9", (> pare )é = 0, 
r=0 


Pt = Ft, v = 1,2, n}, 
where 
(167) Iv = Yo Y1 Y2 Y3”. 


The action of H* on these bundles is the obvious one. The norm of a 
section g is defined by 


(168) lott = | <olp),op) hdt p), 
with 

dpsdpaips 
(169) B30) = apa ptp 


We shall end this discussion with a description of the representation 
U +:2 @ Ut+-2 which lends itself more readily to physical interpretation. 
We consider the bundle 


3 
(170) B= {a : p E Xot, vet, {pw} = povo— >, pv, = o}. 
r=1 
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H* acts on B as follows: 


(171) (p,v)" = (8(h)p,8(h)v). 
For v, v’ e Ct we introduce the Lorentz-Hermite form 
Qlv,v’) = —{v,0'*} = — VoVo * +0,01'* HUW * +0303 *. 


We may regard the fiber B(p) of B at p as the complexified tangent space 
to X, at p so that B is simply the complexified tangent bundle, and (171) 
gives the usual action on such bundles. Since 6(/) is real for each h, 
5(h) preserves this Hermitian form also. We claim that v —> —{v,v*} is 
nonnegative on each fiber of B. It is enough to prove this at (1,0,0,1), where 
the fiber is spanned by w,=(1,0,0,1), w= (0,1,0,0), and w= (0,0,1,0). 
The form, expressed in the basis just described, becomes a,w, +a ,w.+ 
AgW3 —> |a2|?+|a3|?. Let #~ be the space of equivalence classes of Borel 
sections p of B for which 


(172) lol? = |, -fo(2).9(p)}dao*(p) < o. 


H~ is complete, but ||-|| as defined in (172) is not a norm, but only a 
seminorm. In order that ||p||=0, it is necessary and sufficient that the 


vector (%o(P),P1(P),P2(P),ps(p)) be proportional to (Po, Pı; P2; P3), i.e., the 
relations 


(173) Pi Px — Prp; = 0 (0<j9<k <3) 


be satisfied for œo *-almost all p. We write # for the Hilbert space obtained 
by going over to the quotient space modulo (173). 
The representation U~ is first defined in #~ by setting 


(Ui.29)(p) = exp o{x,p}p(8(h)~*p)”. 

We now observe that if {p(p),p(p)*}=90 for almost all p, the same is true 
for Uï „p also. Hence U~ induces a representation, say U, in #. To 
compute the equivalence class of U we proceed, in view of theorem 6.20, 
to determine the representation of the stability group H#* at (1,0,0,1), 
induced on the quotient space of the fiber of B at (1,0,0,1), modulo the 
one-dimensional subspace spanned by (1,0,0,1) itself. Since w,, wa, ws span 
the fiber at (1,0,0,1) we find, modulo w,, that the matrix M, a acts as 
follows: 
(174) Ww —> Re(z?)w. — Im(z?)wsy, 

Ws —> Im(2?)w, + Re(z?) ws. 
Equations (174) show that the representation of H* in question is the 
direct sum of the two characters 


+2 
Mza >z . 


This proves that U~U+t?@U*:-?, 
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The Representations U+? and U+:°-’, These cannot be obtained by any 
limiting process from the representations corresponding to particles of 
nonzero rest mass. Since the inducing representations are infinite dimen- 
sional, these are not of much physical interest as the usual interpretation 
forces the spin observables to have infinite spectra. We do not discuss 
these in this book. The reader who is interested may refer to the articles 
of Bargmann-Wigner [1] and Wigner [5]. 


The Infinitesimal Operators. One of the advantages of describing the 
representations of the group G* in Hilbert spaces of sections of various 
vector bundles is that the infinitesimal operators, which are in general 
unbounded, are all applicable to the smooth sections of the bundle. We 
can thus study the infinitesimal representation in a very explicit fashion. 
If X is any element of the Lie algebra of G* and U is a representation of 
G*, we write X~ for the self-adjoint operator such that 


(175) exp(tX~) = Usxptx- 


The calculations involved in computing the various operators X are very 
simple, and we confine ourselves to the determination of the angular 
momenta M,,, 1<r,s<3 (cf. Section 2 for the definition of the elements 
M,;). 
Mass m > 0, Spin j. We obtain from (134) the equation 
N 
8X) = 5 DD aalX) yee 


= k<£ 


for the differential of the representation (cf. (151)) Sy. Choosing X in the 
Lie algebra of H* so that 6(X)=M,.¢ (l<k<?¢<3), we find, from (154), 


(176) Mie p = M insp + Mio: 


where (p is a smooth section with compact support) 


N 
(177) Mins? = (ë > vey) 
v=] 
and 
ð ð 
178 M~o = ( 2 =>) 
(178) kz:0P Pe op, Pr õp; p 


In formulas (176) through (178), 1 < k </ <3. The operators M zo represent 
the orbital angular momenta, and the operators Mpy, represent the spin 
angular momenta. Note that the operators My, (k</) are bounded, 
whereas the operators M ṣo are essentially self-adjoint when restricted to 
the linear manifold of the C® sections with compact support of the 
bundle, as is easily seen by applying van Hove’s lemma (cf. Chapter VIII) 
to the manifold X,,*. 
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The case of the zero mass particles is discussed similarly. The formulas 
(176) through (178) remain valid without any change. 


The Wave Equations. The traditional way of describing the states of a 
free particle is to put down a linear differential equation in space-time 
whose solutions will form the linear space, the rays of which are to 
describe the dynamical states of the particles. The equation will in general 
be invariant under the relativity group so that there will be a natural 
representation of the relativity group in the space of solutions. Such 
equations are known as wave equations. We shall now obtain wave 
equations corresponding to the various particles. 

There are a few technical points to be attended to in the formulation of 
these equations. In the first place, the solutions in general will be Schwartz 
distributions in space-time. In the second place, not every solution will 
represent a state; the correct way to single out the relevant solutions is to 
define a norm on the space of solutions and take only those whose norm is 
finite. These norms, which are of course Hilbertian, are in general quite 
difficult to describe, because the general element is not a function but a 
distribution. The physical literature in general does not pay attention to 
these details. An a priori approach to the theory of such Hilbert spaces of 
distributions on manifolds has been given by Schwartz [3]. 

From our point of view, we overcome these difficulties quite simply 
because the Hilbert spaces are already precisely defined; the states are 
certain square integrable functions (scalar or vector valued) on the orbits. 
We then treat these as measures on P‘ and take their Fourier transforms. 
The transformed space will consist of distributions, and we can obtain 
the form of the equation satisfied by them by a look at the measures on 
momentum space. It might be mentioned that when there is no momentum 
space (for example, when space-time is curved), the above approach 
will fail. The problem of rigorous descriptions of the wave equations in 
such cases must be considered open. 

Given a distribution in P4, one cannot define its Fourier transform unless 
it is tempered. We recall, therefore, the definition of tempered distributions 
(cf. Schwartz [2]). Let V be a real n-dimensional Euclidean space with the 
positive definite product ¢.,.> and Lebesgue measure dv. For any trans- 
lation invariant differential operator D and any complex polynomial q on 
V, the function 


p —> sup |g(v)(De)(v)| 


is a seminorm on C’,*(V) and the collection of these seminorms induces a 
locally convex topology for C’,”(V); its completion may be identified with 
S(V), the space of C” functions on V for which 


sup lq(v)(De)(v)| < œ 
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for all q and D. If {v,,---, v,} is an orthonormal basis for V, the operators 
Dy, «+ vn = (E/021)"1- + -(0/0x,)" linearly span the algebra of all translation 
invariant differential operators, 2,,---,2x, being the linear coordinate 
functions on V associated with the chosen basis. The topology can then 
be also induced by the collection of the seminorms 


sup \(l+a,?+ n. +2,7)*(D,,, Lee vn P)(L15° vty tp)| 
Liste Ly, 


for various k=0, 1, 2,--- and »,,---,v,20. A tempered distribution Æ 
is a complex valued linear functional on C,”(V) which is continuous in the 
topology induced by these collections of seminorms. By extending these 
to S(V) we may also regard a tempered distribution as a continuous 
linear functional on S(V). 

For any pe S (V), the function ¢, where 


G(x) = (2n)-"? | expl—iczwylp(oWdo (we V) 
is also in S(V) and 
p —> Ê 
is an automorphism of S (V). Given any tempered distribution 
Elp > Elp), p E€ S(V)), 
we may then define its Fourier transform Ê by the equation 


(179) E(g) = E(ĝ). 

If L is any linear automorphism of V of determinant +1, L induces, in 
a natural fashion, an automorphism of the space “(V), and (by duality) 
of the space of tempered distributions. We write 
(180) (E*)(p) = Elp’), 
where g~* is the function v — g(Zv). Taking Fourier transforms of 
(180), we get 


(181) (Et)~ = BO, 
where L° is defined by 
(182) < D°v,v') = w, Low (v, v'e V). 
The map 
F:E>E 
is then a linear automorphism of the space of tempered distributions. Let 
{vis ++, On} be an orthonormal basis, P and Q two complex polynomials 


of n real variables, and write 


qxv + ves +2,V,) = Q(z: or) Ln), 
Dp = P(d/da,,-+-, 0/02,). 
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Then, for any tempered distribution Æ, 
(qDpE)~ = Dg. P*E, 
where P* is the polynomial 


P¥ (x10, + “es + 2,V;) = P(12,,- "5 12) 
and 


. 0 . @ 
Do = Wize i aa) 


All this is well known and may be found in Schwartz’s book [2]. 
The possibility of describing the free particles by relativistically 
invariant wave equations in R* depends on the following lemma. 


Lemma 9.6. Let s be a real number >0 and f a complex valued Borel 


function on P* such that 


(183) [i AO poant) <o (m = 0) 
Then the distribution} 
Tye | Apopa) (PECL) 
is well defined, tempered, and its Fourier transform f, satisfies the “wave 
equation ” 
(184) (+m?) Ê, = 0, 


where 


Proof. We shall show that T, is a tempered distribution. Consider first 
the case m>0. Then X„* is a closed submanifold in Pt and the above 
integral obviously exists for all peC,(P*). To prove that T, is a tempered 
distribution, we take a k>4s+2 and rewrite T,(ọp) as 


1 
Tlg) = 5 | [poppe “+p pop)de, 
where 
G(p) = py ??(1 + po? + p17 + P27 +p)", 


PPop) = PPop) +P tP? + po? + p37)", 
and 
Po = (MPP? + po? + py”). 


ft Actually a complex Borel measure. 
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Since both G and the function p — f (Po.p) po “+? are in Y2(dp), we have, 
for a suitable constant C, and all p e C,” (P$), l 


IT (p)| <C sup |(1+<p,p>)*p(p)|. 


This proves at once that T, is a tempered distribution. 
We proceed to the case m=0. We shall first show that 7',(¢) is well 
defined, i.e., that the function 


P —> f(PoP)P(PoP)290~* 
is Lebesgue integrable in the domain p> 0 of P?. But as 
P—>f(Pop)poS*? and = p— glpop)pi 


are both square integrable in P?, the assertion follows. We now argue as 
before to conclude that 


IT (p)| <C sup \(1+<p,p>)*p(p)| 


for suitable constants C and k. This completes the proof that T, is a 
tempered distribution. 

We may therefore form the Fourier transform 7, of T,. Since T s(p) =0 
whenever ọ = 0 on X,,*, we have, for all pe C” (P$), 


T (Po? — P1? — po? — P3? — m?)p) = 0 
or 


(185) (Po — P17 — P? — py”? — m?)T, = 0. 


The distribution (Po? — P1? — P2? — p37 — m?)T,, is tempered and its Fourier 
transform is (— O-m?) f. Hence (185) leads to the equation 


(O+m2)7, = 0. 


Corollary 9.7. If f moreover vanishes outside a compact subset of P* 
and is also bounded, then.T, is a function on R4—in fact, an entire function 
and the equation (184) is satisfied in the usual sense. 


Proof. T, is now a finite complex measure on P* with compact support. 
It is well known that T, is then an entire function. Equation (184) then 
implies the same relation in the usual classical sense. 


We are now in a position to describe the wave equations in Rt. Let us 
first consider the case of mass m and spin }. The Hilbert space of the 
representation U™ +12 is the space of sections f of the bundle B*+1/2 
with the norm defined by (144). Writing 


f = (foSifasfs ), 
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we have 
3 


(186) 2, ¥rD,f(p) = mf (p) 


for «,,*-almost all p; the norm is given by 


Isl? = > f, , P)Ppo ten *( 0). 


From lemma 9.6 we may now conclude that T,, is a tempered distribu- 
tion on P? for all r=0, 1, 2, 3. Let ® be the Fourier transform of T,, 
considered as a (C*-valued) vectorial distribution on R$, 


© = (Pro PrP n). 


Then Ọ is tempered, and we have, as a consequence of (186), 
3 
(187) 2i n D = mo. 


We observe that the equation 
(O+m?)® = 0 


follows from (187). In fact, the square of the operator X? o ty,(0/0x,) is, 
in view of the commutation rules (129), simply the operator — []; applying 
>3_ 9 ty,(0/Axz,) to both sides of (187) we obtain (184). Equation (187) is 
called the Dirac wave equation of the electron. 

The states of a free relativistic particle of rest mass m>0O and spin } 
may thus be regarded as the rays of a certain Hilbert space of tempered 
C*-valued solutions of (187). If we write out the action of G* on © using 
formulas (146) and (181), we see that ® transforms like a spinor field on 
space-time (with respect to a suitable action of the inhomogeneous Lorentz 
group G on Rt). The equation (187) and the fact that the solutions ® 
transform like spinor fields were first discovered by Dirac [2], and they 
constituted the first great step in the mathematical treatment of 
relativistic systems. 

Using formulas (152), (154), and lemma 12.6, we can write the wave 
equations corresponding to arbitrary spin and arbitrary rest mass m. 
In the general case, with m>0, ® is a tempered distribution on Rt whose 
values are tensors of rank N over Ct. The wave equations are 


3 
„ð 
(188) 2 iv 5 D=mb (v=1,2,---,N). 


If we require the distribution ® to take the values in the space of sym- 
metric tensors of rank N over C+, then (188) corresponds to spin 4N; as 
before, (188) implies the wave equation 


(C]+m?7)0 = 0. 
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If N=2 and the values of Ọ are skew symmetric tensors, then (188) 
defines the spinless particle of nonzero mass. The zero mass equations are 
described similarly. The wave equations are 


3 
ra, V 4 — — 
(189) 2 iy, ie 2 = (v = 1,2,-+-,n) 
and 
(190) iyoy y2" ys 2 = -Ò (v = 1, 2,---, 0) 


for the spin $n case; for spin —4n, the equation (190) is replaced by 
(191) Yo YL Y2 Ya" P = @ (v = l, 2,° ey n). 


In all cases, therefore, we have obtained linear differential equations of 
the first order to describe the states of the particles. The early attempts to 
formulate relativistic one-particle theories led to the second-order equation 
((]+m?)®=0 and were therefore regarded with serious misgiving until 
Dirac discovered the equation (187). 

It is sometimes asserted that the concept of spin enters inevitably in 
any relativistic description of a free particle as soon as one attempts to 
formulate Lorentz invariant first-order wave equations to describe the 
states of the particle. This is a misnomer inasmuch as even the spinless 
particles are described by such equations (cf. (155)). There is only one 
mathematically meaningful way to look at spin—namely, as a conse- 
quence of the method by which one differentiates between different 
systems which are covariant with respect to the group of motions of 
configuration space. 

We have been somewhat vague as to which solutions of these wave 
equations are to be singled out for describing the quantum mechanical 
states of the particle. These solutions will have to be tempered, but this 
alone is not sufficient. In fact, consider a tempered vectorial solution ® 
of (187). We may then write ® =‘, where ¥ is a tempered distribution on 
Pt. Since (O+m?)®=0, (p.?—p—-p.2—-p?-—m’)¥ =0, so that ¥ 
vanishes for all elements of C © (Pt) of the form 


(P — Px" — Po” — p} — mM?) p peC,”(P*). 


It can be shown easily that an element of C,”(P*) has this form if and 
only if it vanishes on X,,+ U X,,~. Therefore, we can construct distribu- 
tions ¥+ and Y- living on the submanifolds X,,* and X,,~, respectively, 
such that P=¥+t+'-. Clearly, ¥-=0 is a necessary condition (the 
so-called restriction to positive energy solutions). But the actual condi- 
tions are very complicated; these imply that ¥- =0, ¥+ is a measure on 
X,* absolutely continuous with respect to a,,+, and (d¥ + /da,*)-po~ 1/7 € 
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L?(X nt ,C4,c,,*). It seems difficult to exhibit natural conditions on ® itself 
which are equivalent to these. 

We shall conclude this section with a few remarks on the representation 
U++2 Gq U++-? which was realized in the space of sections of the com- 
plexified tangent bundle to X,* (modulo the subspace of sections whose 
values at each p € X,* are proportional to p). If we write 


(192) Í = (fofifarfs) 
for a section of the complexified tangent bundle of X,+, then (170) leads to 


(193) Pofolp)— 2, Pubelp) = 0 


Let 
D = qT, = (Do, $1, D2, D3), 


where T, is the vectorial measure, 


T, : p> f _ fpdag*. 
Xo 
Then we have, as (po? —p1?—po7—p3”)T; = 0, 
gö = 0. 


Moreover, (193) gives us 


ð ð ð ð 


1 — 0, -—— ©, — — 
(194) Ox, ° Oa, * ôr 


However, different ® will correspond in general to the same state; the 
condition that ® and ®’ may represent the same vector in the Hilbert 
space of states is the Fourier transform of (173), i.e., 


ð ð ' 

Ba, 1 Be, = 0 (r, k = 0, l, 2, 3) (D- P = F) 
or 

(195) curl ¥ = 0. 


The equations []®=0 are the well known Maxwell equations; the 
condition (194) is the so-called Lorentz condition and (195) gives the 
gauge condition. In particle physics, the representation which we are 
discussing now is assumed to represent the photon. A closer analysis 
reveals that the two representations U +'+2 represent the photon in its 
two states of left and right circular polarization. We do not go into these 
ideas here. 
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6. INVARIANCE UNDER THE INVERSIONS 


We have so far restricted our attention to the transformation properties 
of the states of a free particle under the connected inhomogeneous Lorentz 
group. We shall now examine the behavior of the states under the inver- 
sions also. The main result of this section asserts that the irreducible 
representation of the connected group G* associated with a particle of 
mass m>Q0 can be enlarged in an essentially unique fashion to a repre- 
sentation of the complete inhomogeneous Lorentz group G into the group 
of automorphisms of the states. We shall also obtain explicit formulas 
for the operators which correspond to the inversions. We show further 
that the irreducible representations U +*+” corresponding to zero mass and 
spin n do not possess this property of extension. 

Consider now a representation 


a: (Lx) —_> Qn x) 


of the complete inhomogeneous group G into the group of automorphisms 
of the logic of a Hilbert space. We have seen already that the restric- 
tion of « to G is induced by a unique representation U of G*. Let us now 
take an inversion J, (r=s, t, or st) and consider a symmetry S, of the 
Hilbert space # (on which « and U act) which induces aj, 9). Since 


CL te) = %1,,0)%(L,2)%(r,,0)9 


it follows that the representations (h,x) >S,U,y,.S,~1 and (h,x)—> 
Untx induce the same homomorphism into the projective group of # 
(cf. (69) for definition of h,). This means, as G* has no nontrivial one 
dimensional representations, that 


(196) SU n,o = Gann 


for all (h,x) e G*. Conversely, let U(h,x —> Uy,,)) be a representation of 
G* in a Hilbert space # whose logic is denoted by Y, and let a( L,x > a, x) 
be a representation of G in Aut(#) such that « is induced by U. Since the 
group G is the semidirect product of G and the group of inversions H,,,, 
it is obvious that for « to extend to a representation of G into Aut( 2), 
it is necessary and sufficient that there exist elements «,, a, «, of Aut(#) 
such that 


(197) a,” is the identity 
for r=5,t,s#, 
(198) Ost = Ast 


and, such that, if S, is a symmetry which induces «,, (196) is satisfied for 
all (h,x)e G*. The conditions (197) and (198) are the conditions for 
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I,— a, to be a representation of H;,,, and (196) is just the condition 
characteristic of semidirect products. If such «, exist, then the map 


(199) (LI,,2) —> Kr xj He 


is a representation of G which extends « from G. 

We shall now analyze the conditions (196), (197), and (198). In the 
first place, S, can be unitary or anti-unitary and is determined by q,, 9) up 
to a constant multiplying factor. Since J, is a reflection, S,? must be a 
multiple of the identity, i.e., S,2=¢1 for a constant ¢ with |¢|=1. If S, 
is unitary, we may, by replacing S, by £-1/78,, assume that S,2=1. On 
the other hand, for anti-unitary S,, (cS,)(cS,)=S,? for all complex c with 
absolute value 1, so that in this case ¢ is an intrinsic invariant of ez, 9) 
itself. Actually, €= +1, as the following lemma shows. 


Lemma 9.8. Let S be an anti-unitary operator in a Hulbert space # 
such that S?=€1 for some complex constant ¢ of modulus 1. Then C= +1. 


Proof. Let {e,} be an orthonormal basis for # and let e,’=Se,. Then 
{e,'} is an orthonormal basis also and there exists a unitary operator S, 
such that 

Sie, = e; 
for all j. Write S,=SS,-+. Then S, is anti-unitary and S,27=1. From 
S?= {1 we obtain the equation S,S,S,.~1={S,~!. If E —> Pz is the spectral 
measure of S,, based on the unit circle, then the last equation leads to 


S2P.S2 = P CE 
for all Borel subsets Æ of the unit circle. Since S,2=1, we obtain 
Py = Pyg 


for all Æ. This means that (?=1. 
Let U be a representation of G* in a Hilbert space #. Then there exists 
a unique projection valued measure Q on P+ such that 


Uaz) = f exp i{z, p}dQ(p) 


for all x e R+. The support of Q is called the spectrum of U. 


Lemma 9.9. Let a be a representation of G, the complete inhomogeneous 
Lorentz group, into the group Aut L, L being the logic of the Hilbert space 
H. Let U be the representation of G* which induces « on G. Suppose that 
(a) the restriction of U to the translation subgroup is not trivial, (b) there 
exists a constant c such that the spectrum of U is contained in the set 
{p : Poze}. Then ay,o) 1s induced by an anhi-unitary operator for +=t or 
st, and by a unitary operator if r=8. 
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Proof. Consider first time inversion. Let S, be a symmetry of # which 
induces «q, Let M be the spectrum of U. Then M is a closed subset of 
P* invariant under H. Now on any orbit Y,, (m>0), po can take arbitrary 
negative values. So MC{p : po 20}. The condition (196) gives us 


-1 — 
SD aS; _ Ua yxy 


Suppose now that S, is unitary. Let Q be the projection valued measure of 
the representation x > Ua,» Then 


U (1.42) = f exp i{1,7,p}dQ(p) 


= f. exp {2,1 ,p}dQ(p), 


which shows that x—> Uy 1,2, has the associated projection valued 
measure E —> Q; ım Since S, is unitary, these two projection valued mea- 
sures would be equivalent. In particular, their spectra will be the same. 
This implies that the spectrum of Q is invariant under Z,. This implies 
that M={0}. This contradicts the assumption (a) of the lemma and 
shows that (J,,0) must be represented by an anti-unitary operator. The 
argument for (J,,,0) is similar. Since Z, = Isu (Z,;,0) must be represented by 
a unitary operator. 
We are now in a position to prove the main result of this section. 


Theorem 9.10. Let U be a representation of G* in a Hilbert space H 
with the property that for some m20, Qx_,+ is the identity operator of A, 
Q being the projection valued measure of the representation x —> U, x. 
Suppose U induces the representation « of the group G into Aut( L), L being 
the logic of #. Then, in order that « may extend to a representation of the full 
inhomogeneous group G into Aut(L), it is necessary that the representation 
of the stability group at some point of Xm*, which is associated with U, be 
self-contragredient. If U is equivalent to one of U™ +1 (m>0,7=0, $, 1,---), 
Ut? or Ut+™@ Ut--*™ (n=1, 2,---), then U induces a representation a 
of Gin Aut(L), and « has an extension to a representation of Gin Aut( L); 
in the first two cases, the extension is even unique. 


Proof. Suppose that U induces a representation a of G in Aut(#), and 
that «œ has an extension to G as a representation into Aut(£). By lemma 
12.9, (Z0) must be represented by an anti-unitary operator S, when 
7=st. We have: 


SO n, xst -i= U,, -x 


for all (A,x) e G*. Let p? be any point of Xm* and let 7 be a representation 
of the stability group H° at p° such that U is associated with X„t and v. 
Then, a direct calculation shows that the representation h, x —> Un, -z 
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is associated with —X,,* and v. On the other hand, as S is anti-unitary, 
h,x-> U, -zis contragredient to U, and hence, by lemma 9.3, it is associated 
with — X mt and 7°, where 7° is contragredient to 7. Therefore 7 and 7° must 
be equivalent. 

Let now U be equivalent to one of U™ ++) (m>0), Ut®, Utr OUt», 
It is obvious that U maps the elements ( + 1,0) of G* into multiples of the 
identity. Hence U induces a representation « of G into Aut(#). We shall 
first prove that if U~U™ ++ (m>0) or if U~U*>°, « has at most one 
extension to G as a representation into Aut(#). It is enough to prove that 
for each 7, the element o;,9, of Aut(#) is uniquely determined by U. 
Suppose that S{ and S? are two symmetries such that 


aD i — 
SOT, (SP) — On tex 


for all h, x e G* and r=1, 2. Then (S)-1S@ commutes with all VU, . 
and, being a unitary operator,t it must be a multiple of the identity, as U 
is irreducible. Hence the corresponding elements of Aut(#) coincide. 

We now come to the existence. A simple and direct argument can be 
given, but in view of the general usefulness of explicit formulas, we shall 
prove the existence part of the theorem by actually writing down the 
relevant symmetries. For U=U™++>° we take #=$#7(X,,*,a,*+) and 
write the representation U in the form 


(Un,29)(p) = exp r{x, p}p(3(h)~*p). 
We define the element «y,o of the group Aut( X) as the automorphism 
induced by the symmetry S,, where 
(Sip) PoP) = Je(Po—P): 
(200) (Sap) PoP) = Jp( PoP), 
(S;p)(Po.P) = (Po. —P); 
J being complex conjugation. It is easily checked that 
Sa = S,8; 
and that 
NUn, 8t = Un re: 
Note that these formulas also remain valid when m=0. 
For U=U™ +- (7=4,1,---) we use the vector bundle description 


(154). # is now the Hilbert space of square integrable sections of the 
vector bundle Bł’. Let N=2) and let Jy be the natural conjugation in 


t By lemma 12.9, S, is anti-unitary for 7=1, st and unitary for r=s. 
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C@---@C (N factors). For any endomorphism A in C let A™ be the 
endomorphism A ®---@ A in € @---@ ©. We now define S, by 


(Sepp) = (yorr7¥3)°' npl p), 
(201) (Sipp) = (Yiya) J npl — Lip), 
(Spp) = yo plp). 
Here, p E€ X,,*; note that —I,p=I,p€ X,,*. 

We observe that yo, Y1, tye, y3 are all real. Hence (cf. (147)) for v= 
1,--+, N, Yo, Y1”, ta", Ya” are also real. Therefore yo”, y4”, and y3” commute 
with Jy while y,” anticommutes with Jy. It also follows from the same 
observation that y&, (yiya) ™®, and (yoyiy3) commute with Jy. This 
implies at once that 


3 
(> PrYr r) orrs w = (yoyiy3) (> par’), 


r=0 r=0 


3 
(202) (> av’) (yiya) Ty = (yiya) I y (> Pr Yr '), 


(> PrYr Je = (> pi'y), 
r=0 


where 1<v <N, Po =Po; Py = — P, r=1, 2, 3. These equations show that 
the transformations S, map sections of B} into sections of Bt. More- 
over, an easy calculation shows that y{” are unitary in C@---@ €. 
So we can use (152) to conclude that ||S,¢||? = ||p||2. In other words, the 
S, are symmetries of &. Also, 


S,? = 1, 
(203) S2 =(—1)1 (r=, st), 
Sx. = S,S, = S,S,. 

It follows from the equation S(h)yo = yoS(h*-~+) (cf. (136)) that S, 
satisfies (196). We claim now that S, also satisfies (196). This reduces to 
proving that 
(204) (viva) I ySy(A¥-7) = Sulh) yiys) YS y. 

Since Jy commutes with (y,y3), this reduces to proving that 
(yaya) PE(h* 1) = I Shyy) PY y. 
Now, Jy is the natural conjugation and so we have 
JSK (yrya) Ty = (JS( A) yiya) I 
where we write J for J,. A trivial calculation shows that 
(yiya)S(h¥-*) = JS(A)(yry3)F 
and this proves (204). 
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The case Ux U+:N © U*>~* still remains to be examined. The sym- 
metries S, are again defined by (201). They still satisfy (203). This time, 
the S, operate on the space of square integrable sections of the sum 
Bt Ni2 @ Bt -Ni2, A trivial calculation (cf. (167)) shows that I’ anti- 
commutes with both y% and (yoyıya)™®Jy while it commutes with 
(yiya) Jy (v=1,---, N). From this the results concerning U follow quickly. 
Note that both S, and S,, intertwine U *'™ and Ut-~* while S, leaves the 
subspaces corresponding to U+'+N invariant. 


Corollary 9.11. The representations U+*% of G cannot be extended to 
representations of G into the corresponding groups of automorphisms of the 
underlying logics. 


Proof. The representations of the stability subgroup at (1,0,0,1) are 
the characters Mm, a —> z*™ which are not self-contragredient. 


7. LOCALIZATION 


In Chapter VIII we described up to physical equivalence all systems 
which are irreducibly covariant with respect to the group of all motions 
of a Euclidean configuration space. In three dimensions this led to the 
classification of these systems through their spin and to the familiar 
expressions for the position and momentum observables. On the other 
hand, in this chapter we have described the Lorentz invariant free 
particles and it would be a natural question to ask whether we could write 
expressions for the position operators of these particles. To make this 
question precise, we select a particular inertial coordinate system, and 
ask, for x) =0 in this frame, whether a suitable projection valued measure 
E —> P, can be defined on the Borel sets of R?, the projections acting on 
the Hilbert space # of the particle, such that the range of Pp represents 
the experimental statement that the particle at time t=0 belongs to Æ. 
We shall take the same point of view as we did in Chapter VIII and require 
P(E —> Pç) to be such that the system is covariant with respect to the 
group of rigid motions of R’. This condition can be rephrased in a simple 
manner. Let O be an observer and let £o, 21, %, Vg be his time and space 
coordinates. We identify the space of O with =? and obtain the projection 
valued measure P on the o-algebra of Borel subsets of R°’. If (h,a) e G* 
is such that a) =0 and h e K*, then the new observer O’ determined by (h,a) 
differs from O only in space origin and space orientation. Since Uy, a) 
represents the operator which expresses the transformations of states 
associated with the change from O to 0’, the above mentioned condition of 
covariance can be expressed as follows: 


(205) OPE a OE: = P schyte) + a (h E K*, ap = 0). 
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The problem of describing the position operators can then be regarded as 
the problem of determining, for a given representation U of G*, all the 
projection valued measures P such that (205) is satisfied. 

Let us write V for the representation U when restricted to the subgroup 
of all (h,a) such that aọ=0 and he K*. Let us call this subgroup M. The 
pair (V,P) is a system of imprimitivity for M based on R? and hence we 
can use the results of Chapter VIII to conclude that there exists a unitary 
isomorphism W of # onto #?(R®,4,dx) and a representation g —> s(g) 
of the group K* in X such that for (h,a) € M, 


(206) (WV, a WFX) = 3(h)f(8(h)-*(x—a)), 
and 
(207) (WP,W~'f)(x) = xe(X)f (x). 


In other words, a necessary and sufficient condition that there should 
exist at least one projection valued measure P satisfying (205) is that 
the representation V be unitarily equivalent to the representation, say 
Ls, of M described by the right side of (206). If W is a unitary operator 
effecting this equivalence, P may then be defined by (207). 

The representation appearing on the right side of (206) can be described 
in another, equivalent, form through the use of Fourier-Plancherel trans- 
forms. Let us consider the unique unitary automorphism 


(208) F:f—>f 
of #2(R3,%,dx) onto itself such that for all peC,(R°), 


200) ft fixiota)ax= | f f rye- o(y)dy) de 
Then a straightforward calculation shows that 
(210) (F Li, aF fx) = exp[—ica,x>]s(h)f(8(h)-*x), 


where <.,.»> is the usual Euclidean inner product. We may obviously 
change the measure dx to any rotation invariant measure in the same 
measure class without changing the form or the equivalence class of the 
representation defined by (210). We have therefore obtained the following 
lemma: 


Lemma 9.12. Let V be a representation of M in some Hilbert space #. 
Let m be a real number >0. Then, for the existence of a projection valued 
measure P based on R? such that (V,P) is a system of umprimitivity, it is 
necessary and sufficient that there exist a representation s, 


s:h—>s(h) 
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of K* in a Hulbert space X such that V is equivalent to the representation V° 
of M in L(R?, X am) defined by 


(211) (Vi af (y) = exp[—*a,y>]s(h)f(8(h)~*y), 
a, being the measure given by 

a. 

— (m?-+<y,y>)? 


Note that m plays a completely harmless role in this lemma. 

We shall now discuss the problem of recognizing whether a given 
representation of M is equivalent to some V° as defined by (211). To this 
end, we need to consider representations more general than the repre- 
sentations V*. Let X =R? — {0}, and let us regard X as a K*-space with the 
action A, y — 8(h)y. Given a strict (K*,X)-cocycle C with values in the 
unitary group of a Hilbert space .%{ we consider the representation V° of 
M in £7(X,H am) given by 


(212) dom 


(213) (Viaf (y) = exp[—ta,y>]C(h,8(h)~*y) f(8 (2) ~*y). 
The representation V° is obtained if we take 
C(h,y) = 8(h). 


The strict cocycle C determines for each r>0, the strict (A*,S*)-cocycle 
C,, 


(214) Ci(h,u) = C(h,ru), 


where S? is the unit sphere. Note that A* is transitive on St. 

The next lemma determines the equivalence class of the representation 
VC completely and it is our basic tool in the subsequent analysis of 
localizability. We make a few comments on (213). The projection valued 
measure Q(H —> Qg), where Qg; is multiplication by ypg, is determined in a 
direct fashion by the V$ a: 


V$ a = Í exp[ —i<a,y)]dQ(y). 


From this it follows that the unitary equivalence class of Q is determined 
by the unitary equivalence class of V°. In particular, the dimension of 
HA, which equals the multiplicity of Q, is determined by the unitary 
equivalence class of Ve. 

We recall that T’ is the subgroup of K* consisting of all diagonal 


matrices 
0 
mo=(j p) =n 


T” is the stability subgroup of K* at the points (0,0,r) (r >0). 


380 GEOMETRY OF QUANTUM THEORY 


Lemma 9.13. Let C! and C? be two strict (K*,X)-cocycles with values in 
the unitary groups of Hilbert spaces X and H?. Then, the representations 
V and V of M are equivalent if and only if for almost all r >0, the strict 
(K*,S+)-cocycles C,' and C, define equivalent representations of the subgroup 
T” of K*. 


Proof. Let R*+ ={r:r>0} and Y=R* x8}. The map (r,u)—> ru is a 
homeomorphism of Y with X which sends (7,6(h)u) to 6(h)(ru). It is easily 
shown that the measure a„ on X goes over to a multiple of Bxo on Y 
under this map, where ø is the normalized rotation invariant measure on 
St, and dB =r?(m? +r?) "dr. 

Suppose now that V°’ and V°” are equivalent representations of the 
group M. Then dim #'=dim X? so that we may assume that X = 
KH*=X, Let D be a unitary operator which effects the equivalence. D 
commutes with all the multiplication operators A,, 


(A,f)(x) = exp[—1a,x>] f(x). 


From this it follows easily that D commutes with all operators Qx of the 
form 


(Qef)(X) = xe(x)f (x). 
From lemma 6.4 we conclude that D= D~ for some Borel map «> D(2) 
of X into the unitary group of X; i.e., 
(Df)(x) = D(x)f(x) (xex). 
The condition that D intertwines the restrictions of VC? and V°? to K* 
now leads to the equation 
C?(h,x) = D(8(h)x)C1(h,x) D(x) 7} 


for each he K* for almost all x. Going over to Y= Rt xS' and using 
Fubini theorem, we obtain from this the equation 


C,7(hu) = D,(8(h)u)C}(h,u) D, (u)? 
for almost all triples (4,r,u), where 
D(a) = D(ru) (ueS?). 


In other words, for B-almost all r, the strict (K*,S,)-cocycles C,! and C,? 
are cohomologous. From theorem 5.27, as K* is transitive on S,, it now 
follows that for almost all r, the representations of the stability subgroup 
T” of K*, defined by C,' and C,?, are equivalent. 

We now come to the converse. Suppose C! and C? satisfy the condition 
described above. We may assume that “'1=%7=X£, Then the strict 
(K*,S*)-cocycles C,! and C,? are cohomologous for B-almost all r. Let 


KH, = L (St Xo). 
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Let the representations of K* in #7(S',%c)—say A!" and A?-'—be 
defined by 
(Aj ’f)(u) = C (h,ô(h)-*u)f(8() ~tu). 
Let Qt be the projection valued measure based on S+ defined by 
(Qe'f)(u) = xe(u)f(u) (E s 8"). 


Then, for any r for which C,! and C,? are cohomologous, there exists a 
unitary operator D, in %3 commuting with all Q,' and having the property 
(cf. theorem 6.12) 


D, o Abr o D,7?} = A2". 
Since D, commutes with Q!, D, commutes with the operator in 4%, of 


multiplication by the function u — exp ir¢a,u) for all a,r. If we then 
write W17, for the operator in %3 defined by 


(215) (Waf) (u) = exp[—irca,u>](Aff) (x), 
then 
(216) D, o Wia o D, = Wing 


for all (h,a)e M. Wi" and W?” are representations of M in % and their 
dependence on r is Borel; (216) is valid for B-almost all r>0. 

We shall now show that we may choose D, so that r—> D, is a Borel 
function. To this end, we rewrite (216) as follows: 


D, Wisa = Wia Dr- 
Let us now denote by , the unitary group of %3 and consider the set 2 
of all pairs (D,r) € Y, x R* such that 

DWira = Wi:aD 
for all (h,a) e M. Since W+" is a representation, it is a continuous map 
from M into Y, and hence it is clear that the set 2 is a Borel subset of 
U, x R*, Equation (216) shows that the map D, r — r maps 2 onto almost 
all of Rt. It follows now from von Neumann’s cross-section lemma (cf. 


Chapter V) that there exists a Borel set M € R+ of measure zero and a 
Borel map 


D' :r—> D'(r) 
of R*—N into Y, such that (D'(r),r)¢Q for all re R*—N. Let D’~ 
be the unique unitary operator in #7(R*,.%7,8) defined by 
(D'~ F)(r) = Dir) F(r) 
forre Rt—N. 
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The definition of W+” makes it obvious that the operators W} , defined 
in £7(R* ,44,8) by 
(Wi oF )(r) = Wh F(r) 


are unitary, that W is a representation of M, and that D’~ intertwines 
W? and W?. In particular, Wt and W? are equivalent. Now, the Hilbert 
space £7(R*,4,,8) is canonically isomorphic to #7(R+ xS', X; B xa), 
which in turn is canonically isomorphic to #7(X,%a,,) under the unitary 
isomorphism induced by the measure preserving transformation r, u —> ru 
of R* xS? onto X. A little calculation now shows that the representation 
W) goes over to the representation V” in the Hilbert space #?(X,.%«.,,). 
Consequently, V“ and V°’ are equivalent. This completes the proof of the 
lemma. 


Corollary 9.14. In order that there exist a projection-valued measure P 
based on R? such that (Ve, P) is a system of imprimitivity for the group M, 
it is necessary and sufficient that there exist a representation s 


s:h—>s(h) 


of K* acting in X; such that for almost all r>0, the restriction of s to the 
subgroup T” is equivalent to the representation h —> C1(h,(0,0,r)) of T". 


Proof. Let s be a representation of K* and let C° be the strict (K*,X)- 


cocycle defined by 
Cs : h, x —> s(h). 


Then the necessary and sufficient condition for the existence of a suitable 
P is simply that, for a suitable s, C? and C" satisfy the condition described 
in lemma 9.13. The corollary now follows at once if we observe that 6(h) 
fixes (0,0,7) if and only if h =m, o for a suitable ¢ with |¢| =1. 


Corollary 9.15. Let (V,P°) be a system of imprimitivity based on R? 
for the group M and let P? have finite multiplicity. Then, an arbitrary 
projection valued measure P based on R? has the property that (V,P) is also 
a system of imprimitivity for M if and only tf there exists a unitary operator 
A commuting with V such that 


(217) P = Ao P?o A^`!. 


Proof. If P is defined by (217) for a unitary A lying in the commuting 
ring of V, it is clear that (V,P) is a system of imprimitivity for M. Con- 
versely, let (V, P) be a system of imprimitivity based on R? for M. Then 
(V,P) and (V,P?) are equivalent to the systems induced by certain repre- 
sentations s and s° of K*. Since V~V* and V~V*, we have Vix V*. 


f Since M is transitive on R?, P° is homogeneous (lemma 9.10). Hence it has a 
well-defined multiplicity. 
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From lemma 9.13 applied to the strict cocycles h, y->s(h), h, y>6°(h), 
we conclude that the restrictions of s and s° to the diagonal subgroup T” = 
{m;,o} are equivalent. Now a look at the formula (211) shows that the 
multiplicity of P° is the dimension of s°. Hence, s° is finite dimensional. 
Consequently, s is finite dimensional also. Now, it is clear that two 
irreducible representations of K* are equivalent as soon as their restric- 
tions to T’ are equivalent. Using the decomposition into irreducible 
components, one can extend this result easily to finite dimensional repre- 
sentations which are not irreducible. This result implies that s and s° are 
equivalent. In other words, the systems of imprimitivity (V,P) and 
(V,P°) are themselves equivalent. Consequently there exists a unitary 
operator A commuting with V which satisfies (217). This finishes the proof 
of the corollary. 

We are now in a position to prove our basic result on the existence of 
position operators. Let © be a system to which is associated the repre- 
sentation U of the inhomogeneous Lorentz group in the Hilbert space 7 
Then © is said to be localizable at time x)=0 if there exists a projection 
valued measure P (in #) based on R? such that 


-1 _ 
On oP EUR 3G — Paniei+a 


for all (h, a) e M. The following theorem is the main result of this section 
(cf. Wightman [1}). 


Theorem 9.16. Let the representation U associated with a system © be 
defined by the pair (X,,*,m), where m>0, and v is a representation of the 
stability subgroup of L* at the point p™, where we write p™ for (m,0,0,0) 
when m>0 and (1,0,0,1) when m=0. If m>0, © is localizable. If m=0, 
© tis localizable if and only if there exists a representation n’ of K* such that 
a and x’ have equivalent restrictions to the subgroup T" of all diagonal 
matrices of K*. 


Proof. We note that K* is the stability subgroup at p™ for m>0 and 
E* is the stability subgroup at p™ for m=0. Let us now select a strict 
(H*,X,,* )-cocycle C with values in the Hilbert space X in which ~ acts, 
such that C defines the representation z at p™. The representation U acts 
in # = L7(X,*,4,a,,*) and, for (h,a) ¢ M, we have 


(On af)(p) = exp[—iKa,p>]C(h,3(h)~*p) f(8(A)~*p). 


We now use the bijection p —> ((m?+<p,p))"?,p) and reduce this to the 
form (218). Corollary 9.14 is then applicable and leads to the following 
necessary and sufficient condition for localizability: © is localizable if and 
only if there exists a representation 7’ of K* such that, for almost all 
r>0, the representation of the stability subgroup at ((m?+1r?)1/2,0,0,r), 
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defined by C, has a restriction to the diagonal subgroup 7” that is 
equivalent to the restriction of 7’ to T”. 
Consider now the point P, ,=((m?+1r?)!/2,0,0,r). Let h,, be the 2x2 
diagonal matrix with entries à and 1/A, where 
le ee (m > 0), 
p712 (m = 0). 
It is easy to check that ô(hm)P,,m=p™. If we now write H*,, for the 


stability group at p,,,, then T'=H*m O K* and its elements commute 
with Am. From the cocycle identities we now find, for h e T”, 
C(h,p™) = C(hmhhm *,p) 
= C( hm Pr,m)ClCh, Pr, m)l hm~ p™) 
so that 
O(h, Pr,m) = RC(h,p™)R-?, 
where 
R= Chm Pr,m) > 
In other words, the representations of T” defined by C at the points 
((m? +r?)"2,0,0,r) are all mutually equivalent and are equivalent to the 
restriction of the representation 7 to T’. We may now apply the criterion 
for localizability described in the previous paragraph to conclude that © 
is localizable if and only if for some representation 7’ of K*, 7, and 7’ 
have equivalent restrictions to T”. 

For m>0, we may take 7=7’ always and so any system © whose 
associated representation is defined by the orbit X,,+ with m>0O is 
localizable. For m=0 the condition obtained is exactly the one stated in 
the theorem. 

Corollary 9.17 (Newton-Wigner [1]). The particles corresponding to the 
representations U™+*+4 (m>0,7=0, 4,1,---) and U*'? are localizable, 
whereas those corresponding to the representations U*+*" (n>1), are not. 
The system corresponding to the representation U +’! GQ U*>—* is localizable. 
The photon is not localizable. 

Proof. The Jocalizability of the systems corresponding to U™ ++ 
(m>0) and U+? is immediate from theorem 9.16. Since K* has no non- 
trivial one-dimensional representations, the system corresponding to 
U +++" ig not localizable if n> 1. 

We now come to the systems whose corresponding representations are 
reducible. The representation U +" @ U+>~* is associated with (X* 7), 
where 7™ is the representation 


œ o 
uae t z-a) 


of E* (cf. (84)). For n=1, x is obviously the restriction of D2 to T”. 
Hence the system corresponding to Ut:1@U*>~! is localizable. If 
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n > 2, the representations D’ whose restrictions to T” contain the character 
Mr o —> ¢” are precisely those for which 27>n and 2j—n is an integer; the 
dimension of D? will then have to be >2. Hence none of the systems 
corresponding to U +” @ U+:-”, n22, is localizable. For n=2, we obtain 
the result that the photon is not localizable. 


Remark. The representation of K* of the lowest dimension whose 
restriction to T” contains the character m, o —> ¢? is D1, namely the repre- 
sentation k —> 8&(k) (k e K*). But the restriction of Dt to T’ contains the 
characters Mm; o —> ¢*? and the trivial character Mm; ọ — 1. In other words, 
the failure of the photon to be localizable stems, in the last analysis, from 
the fact that, for a given momentum vector, it has only two independent 
spin angular momentum states and that, in particular, it does not have 
any spin angular momentum parallel to its momentum. 

The particles with nonzero rest mass have been shown to be localizable, 
and for these there remains the problem of obtaining the expressions for 
the position operators. Such formulas were written by Newton and 
Wigner [1]. We shall now turn to this question. Instead of the position 
operators which are unbounded it is more convenient to work with the 
unitary representation of the additive group of R? they generate. To be 
more explicit, let U be the representation associated with a localizable 
system and let P be a projection valued measure based on R? such that 
the relations (205) are satisfied. We define, for xeR3, 


Bix) = | expl—ixy)dPty)] = expl—in(x)] 


The B(x) are unitary and B(x->B(x)) is a representation of R3. The 
q(x) are the position operators. The correspondence (218) between B and 
P is one-one. It is easily checked that P satisfies (205) if and only if 


(218) Un,aB(x)Uk,a = exp[t(d(h)x,a>].B(3(h)x) 


for all xeR? and (h,a)e M. 
Let us define W,, , and L(a) as follows: 


Whix = B(x)U ho, 
L(a) = Ui, -a 


Here (h,a)e M. The set of all pairs (h,x) is identified also with M. It 
follows after a brief calculation that 


W k h, x —> Whex 
is a representation of M and that (218) is equivalent to 


(219) Wy xLa) Wz 2 = expliC8(h)a,x)]L(8(A)a). 
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We shall now use (219) to write by inspection the expressions for the 
B(x). Let m>0 and let U be the representation U™ +° acting in 7 = 
L?(X n*am); noting that when a=(0,a), {a,p}= —<a,p> we find 


(Unaf (p) = exp[—*Ka,p>] f(8(h)~*p). 
The map 
7: p—->p 
establishes a homeomorphism of X,,+ with P’. We shall use the notation 
Tą to denote the action in X,,* obtained by lifting the translation in P? 
via 7: 


(220) n(Tq(p)) = p+q. 


Since the elements of K* leave pọ invariant, an easy calculation shows 
that 


8(4)7,5(h)~* = Tana (h e K*). 
From this it follows that the map 


(h,a). P —> T- alô(h)p) 


converts Xm„* into an M-space. In the Hilbert space 2#, L(a) is the 
operator of multiplication by the function p— exp i<a,p> and (219) 
shows that we have a system of imprimitivity for M based on X„t 
considered as an M-space. The measure «,,*+ is, however, not invariant 


under the “translations” 7, and, on noting this, the following expression 
for the B(x) may be immediately written: 


(221) BAA = {ZEY perp), 


where pọ is, as usual, the function p —> (m? + <p,p)!?. The reader can 
verify at once that the B(x) are unitary and that equation (218) is satisfied. 
The numerical factor on the right side comes as usual from the Radon- 
Nikodym derivatives. If we write q, for the operator q(e,), where ex 
is the vector, all of whose components except the kth one vanish and whose 
kth component is unity, we obtain from (221) the equation 


(222) iq. f = dt TA J: 


here Pı, Po, Pz are global coordinates on o and the 2/dp, are the 
corresponding derivatives, while f is a smooth function vanishing outside a 
compact set. 

The most general B satisfying (218) can now be determined by an 
application of corollary 9.15. Let W be any unitary operator commuting 
with the representation U restricted to M. Commutativity with all 
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Ug (49 =90) leads at once to the fact that W is multiplication by a Borel 
function, say u, such that |u(p)|=1 for all p. The commutativity with all 
Uno (h e K*) now forces u to be K*-invariant and hence to be a function 
of pọ. Hence the most general B such that x — B(x) is a unitary repre- 
sentation of R? satisfying (218) is of the form 


“fltgp), 


BSNP) = explillpa)(p)—b(po)(r4p))}{ 22L 


where k is an arbitrary real valued Borel function on the positive reals. 

We shall now proceed to the case of nonzero spin. Let us take U = U™ + J 
in the form (154), acting in the Hilbert space # of sections of the bundle 
B}. Let us assume first that j=4. It is natural to assume for B(x) an 
expression of the form 


(Bex) fp) = {2P | SR p rp), 


where R(x,p) € H* and S(h —> S(h)) is the usual spin representation of H*. 
In order that S(R(x,p)) may transform the fiber at 7,p to the fiber at p, 
it is enough to assume that 5(R(x,p)*) maps p onto 7,p. In order that B 
may be a representation satisfying (218), we should require that the map 
x, p —> R(x,p) satisfy the conditions: 

(i) AR(x,p)h~1 = R(d(h)x,d(h)p) (h e K*), 

(ii) R(x+x',p) = R(x, p) B(x’, Tx p), 

R(0,p) = 1, 

(iii) 8(R(x,p)*)p = txp. 
An easy calculation using (152) shows that each B(x) will be unitary. 

We shall now “‘solve”’ equations (223) for the R(x,p). R? acts freely and 
transitively on X,,* by the action x, p —> 7, :p, and (ii) of (223) just asserts 
that x, p > R(x,p)-} is a strict (R*,X,,‘)-cocycle. Therefore, by lemma 
5.23 we may write 


(223) 


R(x,p) = L(p)-*L(rxp), 
where p —> Lp) is an arbitrary function. The condition (i) then becomes, 
on simplification, 


L(8(h)p)hL(p)-* = L(8(h)ryp)hL(7,p)-* 


This shows that the function p —> L(8(h)p)hL(p)~?} is 7,-invariant for all 
x, and therefore must be a constant, possibly depending on h: 


L(8(h)p)hL(p)-1 =C, (he K*). 


Equation (iii) of (223) can be analyzed similarly. We have, on substituting 
the expression for R in (iii), 


8(L(p)*-*)p = 8(L(rxp)*~*)(rx2). 
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This shows that p — 6(L(p)*~*')p must be a constant also. If we use the 
map 


p—> Alp) = Poto + P10: + P202 + P303 


which sends elements of P4 into 2x 2 Hermitian matrices, and write the 
condition that p — 6(L(p*)~*)p has the same value for all p, we get 


L(p)*~*A(p)L(p)-* = mL(p™)*-*L(p™)~? 
where p™ = (m,0,0,0). (iii) of (223) therefore can be replaced by 
A(p) = m{L(p™)~*L(p)}*{L(p™)~*L(p)}. 
It is now easy to determine a solution L. We define L(p™)=1 and 
(224) L(p) = m~*?A(p)??, 


Note that for p € X,,+ (m>0), A(p) is positive definite so that L(p) is 
well defined and det L(p)=1. We then obtain, for R, the following 
expression: 


(225) R(x,p) = A(p) Alp)”. 


We note that (i) of (223) is automatically satisfied (C, =h). Collecting all 
this, we obtain 


22) (BUx)f)(p) = { PAPEL" SIAPA (reP) 


Actually we now use (226) to define B. It is not difficult to check that the 
B(x) are unitary and that x > B(x) is a unitary representation of R3 
satisfying (218). For higher spin, we replace (226) by 


(227) (B) f)(p) = { Pole) Py Su(A(p) -YAra p)? S (rap), 


where Sy=S Q-Q S (N=2) factors). 
We can now write the infinitesimal form of (226). To do this it is 

necessary, among other things, to compute 
Rep) = {5 S(4(p)-29.4( 90)! 


Obviously, if we write 


(xp) = ApS Ari) 


t=0 


=0 


then 9t(x,p) is a matrix of zero trace and 


R(x,p) = S(N(x,p)), 
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where S is the differential of the representation S. From (i) of (223), we 
have: 


(228) S(A)R(x,p)S(h)~* = R(8(h)x,d(h)p) (he K*), 


so that R is uniquely determined by its values for all x and all p of the 
form p = (Po,0,0, p3). For fixed x and such p, we find, on writing 


d 
“= ii {A(T P) h0 
and differentiating the equation [A(r7,,p)!/?]?=A(7xp), that 


%3(Po+Ps)/Do Lı — Iq 
Alp) ?u+uA(p)"? - ( , ) 
Tı + Xe — %3( Po —P3)/Po 


Using the fact that u is Hermitian we can compute it from this equation. 
Then N(x, p) reduces, after some simplification, to 


M(x,p) = ( %3[2Po (£; —i£2)/Po + P3 | . 


£i +i£2)/Po— Pg +m — x3/2Po 


Expressing N(x, p) as a real linear combination of the c, and the io,, and 
using (54) and (134), we find, after some calculation, that 


l < T3 P3 
RX p) = zz > TYY: — Fram (pom) 23 


r=1 


l 
— 3mp, +m) {£2 P3Y2V3 — X1 P3Y3V17 
This formula is valid for p= (po,0,0, p3). A look at the requirement (228) 
of covariance now enables us to write the expression for R(x,p) in the 
general case: 


— <x, 3 l 3 
R(x, p) = <x? > PYY FZ > LsYoYs 
1 j=1 


2MPo( Po +m) £ 
(229) 

1 
-onmin Lam > (Xj Pr — Er PIY 


2M(PotM) 1 <<hes 
The fact that R(x,p) as defined above satisfies (228) for all unitary h is 
straightforward. It is also trivial that, for p, =p, =0 (229) reduces to the 
earlier expression. We omit the verifications of these. We finally obtain, 
from (229), the following expressions for the position operators (x): 


(230) if = = Dm =S SER J-R 
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for any smooth section f vanishing outside a compact set. The generaliza- 
tion of (230) to arbitrary j is trivial. If we define R°(x,p) by (229) with 
y,” replacing y, (cf. (147), for the meaning of y,’), and write 


Rep) = > Wp), 


then (230) once again gives the expressions for the iq(x) when 7=4N is 
arbitrary. 

We shall now take up the question of uniqueness. We consider first 
j=}. In view of corollary 9.15, we may replace the unitary representation 
B by W o Bo W—}, where W is a unitary operator which commutes with 
all Un a (h,a) ¢ M, without destroying the imprimitivity equations (205). 
Now, for any such W, the fact that the U,, (ag=0) commute with it 
shows that there exists a Borel map 


w:p—>w(p) 


of X,,* into the unitary group of C* such that, for each p, w( p) leaves the 
fiber of Bf +1? at p invariant, and (W¢)(p)=w(p)¢(p) for almost all p 
and all p of the Hilbert space. The commutativity with all U, o (k e K*) 
now leads to the equations 


w(d(h)p)S(h) = S(h)w(p) 


for each h for almost all p. For p=(o,0,0,p3) with p,>0, the S(h) for 
those A with 6(h)p=p are diagonal. It is then not too difficult to determine 
the w which satisfy these equations. We choose a Borel function v, r —> v(r), 
of the positive reals into the group of 2x2 diagonal matrices of A* 
arbitrarily and define w by 


vr) 0 


w(p,) = ( 0 v(r) 


(p, = ((m?+r7)*?,0,0,r), v(0) = 1), 


w(3(h)p,) = S(h)w(p,)S(h)~* (he K*). 


It is clear that w is well defined and meets the requirements. For a given w, 
the unitary operator W corresponding to it is given by 


(Wo)(p) = w(p)p(p) (pe X,,*). 


The unitaries thus obtained exhaust the commuting ring in question. 
The case of higher spin is handled similarly. 

It is noteworthy that the position observables cannot be uniquely 
determined. Formulas (222) and (230), however, seem to give the simplest 
of the possible choices. For the choices represented by (222) and (230), 
we can calculate the velocity observables by differentiating the position 
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observables with respect to time. Since (cf. Chapter XI) these are their 
commutators with energy, we find, using (230) and writing q¢,=q(e,), 


(231) Vk = Ík = PrlPo- 
The velocities are conserved in time. Formulas (231) are the same as those 
in classical relativistic mechanics. 


8. GALILEAN RELATIVITY 


Our concern so far has been exclusively with the relativity prescribed 
by the Lorentz group. Clearly the whole circle of problems can be analyzed 
under the over-all assumption that the space-time group is the inhomo- 
geneous group of Galilean transformations. We shall indicate briefly in 
this section the modifications that are to be made in the foregoing analysis 
when we change the underlying group. To describe the free particles 
under the assumption of Galilean relativity, we must describe and 
classify up to physical equivalence the irreducible (possibly projective) 
representations of the inhomogeneous Galilean group. We carry out our 
discussion along the same lines as we did for the Lorentz case. 

Let us now consider the universal covering group of the inhomogeneous 
Galilean group (cf. (112) through (123) of Chapter VII). This consists of 
the quadruples 

(h,,v,u) (he K*, neR!, v, ue R3), 


with the multiplication law 
(h,n,v,u)(h’,n’,v’,u’) = (hh’, ntn, (h) +y, S(hju' +u+7’v), 


where h, k'e K*, 1, 9’¢R', u, veR?. The space-time translations (1,7,0,u) 
form an abelian normal subgroup; the group of the homogeneous Galilean 
transformations (h,0,v,0) acts on the translations (via inner automorphisms) 
as follows: 

(h,0,v,0) : (1,7,0,u) — (1,7,0,5(h)u+ nv). 
This gives, of course, the semidirect product structure of the inhomo- 
geneous Galilean group. We introduce “‘momentum space” Pt of points 


(Pop) as the dual of the vector space of space-time translations (1,7,0,u). 
Each (p,p) gives rise to the character 


(1,y,0,u) > exp[?( pon +<p,u>)], 
where <. ,.) is the Euclidean inner product in R3. The adjoint action of 


(h,0,v,0) in Pt can now be computed; an easy calculation shows this 
to be: 


(232) (4,0,v,0) : (po,P) > (Po—<v,8()p>, 8(h)p). 
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From (232) one can easily obtain the orbits for the homogeneous group in 
P+. We claim that the orbits are 


(233) Zn, = {(P0.9)}  (—00 < Po < œ), 
and 
(234) X, = {(Po p): PP =r} (r > 0). 


That Z,,, which consists of a single point, is an orbit is trivial. On the other 
hand, (232) shows that X, is invariant. We shall show that any point of 
X, is obtained from (0,(0,0,r)) by applying a suitable element (h,0,v,0). 
In fact, (232) shows that any point (0,p) with (p,p) =r? can be so obtained; 
to pass from (0,p) to (Pop) we choose v e R? so that <v,p = — P, and use 
(1,0,v,0). Note that X, “splits” into R! xS”, where S” is the sphere of 
radius r in R?. An easy calculation based on Fubini theorem shows that on 
X,, the measure dp,do” is invariant, dp, being Lebesgue measure on RI 
and o’ the unique normalized rotation invariant measure on S’. The 
stability group at (0,0) is the entire homogeneous group. The stability 
group at (0,(0,0,7)) consists of all elements (h,0,v,0), where 


0 
h= mo = (i a) (= 1) 


and 

V = (V4,V2,0) (vi, Vg arbitrary). 
The product of (m, 9,0,v,0) and (My ,0,v’,0) is (Mme 5,0,v",0), where 

v” = (mz o)V' +v. 
On writing 
z(v) = V — Wa, 
we obtain for z(v”): 
2(v") = C72(v’)+2(v). 

In other words, the stability subgroup at (0,(0,0,r)) is isomorphic to the 
semidirect product of C and 7 under the mapping (m; ,0,v,0) —> (2,z(v)), 
wherein the product in T x C is determined according to 


(235) (f,2)(C',2') = (00",072' +2). 

The orbit structure of P* is obviously smooth. For the stability subgroup 
at (0,(0,0,7)) we have already described the representations. These are 
the Ta, 

Tn : (Mz 9,0,v,0) —> f" (n = 0, +1, +2,---) 


and the r, and 7,’ (p>0); the latter act in ¥#(7') and are described by 
(cf. (98) and (99)) 


(17,(™mz,9,0,V,0)f)(t) = exp[ip Re(t2~*2(v)*)]f(o~ *t), 
(a7y'(mz,0,0,V,0)f)(t) = E exp[tp Re(tg-*z(v)*)] f (7t). 
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The representations L” associated with any one of these irreducibles, say 
a, may now be written explicitly. We only note that 


(236) (Li,n o, wf) (Pop) = expl[t(nPo+<u,p>)]f (PoP), 


f belonging to a Hilbert space of scalar or vector valued functions on X,. 
The representations associated with Z,, are 


(237) (h,y,V,U) —> exp(ipon)r(h,0,v,0), 


where r is an arbitrary irreducible representation of the homogeneous 
group. 

It is remarkable that none of these representations can represent any 
physical particle. We shall see now why this is so. Formula (237) shows 
that the time translations act as scalars and hence leave every ray fixed— 
there is no temporal evolution. Hence it becomes necessary to give up 
these representations. We come next to the representations L” associated 
with the orbits X, (r>0). Formula (236) shows that the momentum 
operators B,, B,, Bs corresponding to the translations along the axes in 
space are the operators of multiplication by p,, Pa, and p3. But p,? + 
P2 +p3? =r? and hence 


(238) B? + B2+ B32 = r71, 


which gives an unphysical relation between the three momenta. More- 
over, it can be quickly seen that the systems corresponding to these 
representations are not localizable either. To see this, let us write V for the 
restriction of any one of these representations L” to the subgroup of 
translations in space only and let us assume that there exists a projection 
valued measure P(E — Pz) based on the o-algebra of Borel sets of R? 
acting in the Hilbert space # of V such that 


VuPeVy 7! = Pru 


for all uc R? and E c R3. From theorem 6.17, we know that there exists a 
unitary isomorphism W of # with 2?(R°,%,dx), where X is a Hilbert 
space, such that 


(WV Wo") f)(x) = f(x—u) 


for all fe #?(R?,%,dx). Now, u -> V, is a representation of the abelian 
group R? and hence there exists a unique projection valued measure Q 
on the space of vectors p such that 


Vy = Í explicu,p)]dQ(p). 
p? 
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From the uniqueness of Q and from formula (236) we see that Q vanishes 
for the complement of the set {p : (p,p) =r}; in fact, (236) shows that in the 
Hilbert space 


KH! = LUX,,H do x dpo), 


X being the space of 7, V, acts as multiplication by the function p —> 
exp i<u,p» and hence Q; must be the operator of multiplication by yg, 
proving that Q vanishes for the complement of the set <p,p)=r?. There- 
fore, the projection valued measure WQW~! also vanishes for the 
complement of the set {p : <p,p)=7r7}. On the other hand, we use the 
Fourier-Plancherel isomorphism (cf. (209)) 

F:f—>f 
of 22(R°,47dx) with £2(P°,% dp) to find 

(FWVW-*F-*f)(p) = expl —i<u,p)] F(p). 

This implies that the measure class of the projection valued measure 
WQW -t is that of Lebesgue measure, which is a contradiction, as we have 
already shown that this measure class is supported by the set 
{p : <p,p>=r7} of zero Lebesgue measure. Therefore, the systems 
corresponding to the representations L" are not localizable. 

Our analysis therefore shows that we must consider the projective 
representations of the inhomogeneous Galilean group in order to be able 
to build a theory of Galilean invariant particles. We shall now indicate 
the details of this theory. We shall use the results and the theory of 
multipliers as developed in Chapter VII. According to the results we have 
obtained there, any nonexact multiplier of the covering group of the 
Galilean group is similar to the multiplier m, which is given by 


(239) m,: 7,7’ > exp? 5 [<u,d(h)v’> — <v,d(A)u’> +7’<v,8(h)v’>], 


where 


1 


r = (hm,v,u), r = (h'm, u), 
and 
r#90 
is a real number. Given m,, we can construct the associated extension of 


the inhomogeneous group in the usual way. The elements of this extension 
are pairs (r; ¢) with |¢|=1, and the group operation is given by 


(r; (r; O°) = (rr'; bC'm,(r,r’)). 
We denote this group by G". Now m,(r,r’)=1 when v=v’=0 and so 
(1,7,0,u; 6)(1,9',0,u'; 0’) = (1, +7’,0,u’+u; £2’), 
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so that the set of all elements of the form (1,7,0,u; ¢) forms an abelian 
group. We denote this group by A‘ and, when convenient, its typical 
element by (n,u; ¢). Further, m,(r,r')=1 when u=u’=0 and 7’=0 so 
that 


(h,0,v,0; 1)(h’,0,v’,0; 1) = (hh’,0,8(h)v’ +v,0; 1), 


showing that the set of all elements (h,0,v,0; 1) forms a subgroup of G. 
We write H" for this subgroup and, when convenient, denote its typical 
element by (h,v; 1). We have: 


(240) (ru; texp|—i 5 <uv>| Jon 1) = (h,n,v,u; €). 


G" is thus a semidirect product of A‘ and H". The action of H* on A‘ via 
inner automorphisms is given by 


(241) (h,v; 1) : (y,u; ¢) > (7,6(h)ut nv,¢’), 
where 
(242) t’ = Cexp —i 5 [2v,8(h)u> +9¢v,¥)]. 


We are interested in describing those irreducible representations of G" 
which map the elements (1,0,0,0; ¢) into €-+ times the identity, for these 
are precisely the representations which give rise to the m,-representations 
of the inhomogeneous Galilean group (theorem 7.16). Fortunately, G7 
is itself a semidirect product of A* and H*, and so we can determine its 
irreducible representations using the theory of Section 8, Chapter VI. 


“N 
To this end, we must determine the dual group A‘ of A‘ and classify the 


“™ YN 
H" orbits in A*. A typical element of A* will be written as (»,p; n) where 
Po E Rt, pe R?, and n is an integer; we shall associate with it the character 


(n.u; 2) —> ¢” exp[t(npo + <u,p>)]. 
A brief calculation based on (241) shows that the adjoint action of the 
element (h,v; 1) in a is given by 
(243) (h.v; 1) : (Do.p,7) —> (Pop'n), 
where 


Po = Po—<v,9(h)p> — 4nz<v,v), 


(244) 
p = o(h)p+nrv. 


The reader may observe the difference between this action and the action 
(232). 
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It is quite simple to determine the orbits and stability groups associated 
with the action (244). A brief calculation shows that 


(245) CPP) +2nTPo = LPP + 2nrpo’. 


In other words, the function (o,p; n) —> <p,p>+2n7p, is invariant under 
H' for each fixed n. Let us now define the sets Z, , by 


(246) Zio = {(Po p; n) : CPP) +2nTPo = p}; 


here n is an integer, p a real number, n and p are arbitrary except when 
n=0, for which case p>0. The Z, „ are invariant under H* and 


“N 
A = U Zano 
n, p 


When n=0, (244) reduces to (232) and we find that the sets Zo, (p>0) 
are orbits. The set Zo is not an orbit; Zo o={(0,0; 0) : poeR?}, and 
each point of it is invariant under H*. For n¥0, and arbitrary p, Z,,, is 
an orbit. In fact, the point 


(247) ano = ((207)~1p,0; n) 


lies in Z,,, and one computes quickly from (243) that the element 
(1,(m7)~4p;1) of H* sends a, to the point ((2n7)~}(p—<p,p>),p; n) 
which is, of course, the general point of Z, ,. The stability subgroup of 
H' at a,_, is the group of all elements of the form 


(A,0; 1) 
which is isomorphic to A* (the unitary group). The map 
E : p —> ((2n7)~*(p—<p,p>),p; n) 


is a homeomorphism of Z, , (n#0) with P? and allows us to treat p, pz, 
and p as coordinates on Z, ,. The element (h,0; 1) sends (Po.p; n) to 
(10,6(2)p; n) while the element (1,v; 1) sends (Po.p; n) to the element 
(Yo—<V.p> — 4n7zK<v,V>, p+nrv; n). From this it follows easily through an 
application of Fubini theorem that the measure dp is invariant under H". 


Finally it is obvious that the entire orbit structure of T is smooth. 
Unless n= — 1, the orbit Z, „, will not lead to an m,-representation of G. 
If Z is an orbit and U an irreducible representation of G* associated with 
Z, then U acts on a Hilbert space # of functions defined on Z with values 
in some Hilbert space X. From the general theory of Chapter VI, we know 
that for any ae A‘, U, will be the operator of multiplication by the 
function 2 — Z(a) (2 € Z). In particular, if ZS Z, ,, the element (0,0; 2) of 
A* will be mapped, in any irreducible representation associated with Z, 
into the operator ¢"I (I being the identity operator). Consequently, the 
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m,-representations of G are precisely those that are associated with the 
orbits Z_, ,, p arbitrary. 
We shall now describe these representations. Exactly as in the case of 


the Euclidean group (cf. Chapter VIII), we find that, if 
s:h—>s(h) 


is an irreducible representation of the unitary group K* in a (finite 
dimensional) Hilbert space %, the map (h,v; 1), £ —> s(h) (2€ Z_,,,) is a 
strict cocycle which defines the representation (h,0; 1)—>s(h) of the 
stability group at a_, ,. If we denote by # the Hilbert space 


(248) KH = LP, X dp), 

then the representation U"*” of G", associated with Z_,,, and s, is given 
by 

(249) (URE Sp) = Q's(h) f(8(h)- "(p+ rv), 


where r=(h,7,v,u) and 
(250) Q! = (Letomi2t) -1 exp [u P+5 v) +5 Pp]; 


for this we have only to use the map identifying Z_, p With P®, the 
equation (240) which displays explicitly the semidirect product structure 
of G*, the equation (244) which gives the adjoint action of H', and sub- 
stitute in the general formula (132) of Chapter VI. 

The map r —> U$: S’ will then be the irreducible m, representation of the 
inhomogeneous Galilean group associated with U**. We now observe 
that the number p enters (250) only in the factor e~'°"/?* and hence may be 
dropped when we describe the corresponding projective representation of 
the inhomogeneous Galilean group. We shall write V™ for this irreducible, 
projective representation. It acts on # and 


(251) (VESP) = Q8(h) f(8(2)~ "(p+ rv)) 
with 
(252) Q = exp il (u p+5 v) +5 pp]. 


In this form, the multiplier for V** will be similar to m,. 

There remains the question of physical equivalence among these 
representations. We shall show first that V% and V -=s are anti-unitarily | 
equivalent. Consider any anti-unitary isomorphism J of the Hilbert space 
X onto itself and define © by 


(253) (Of)(p) = Jf(- 
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©. is an anti-unitary isomorphism of #*(P?,%jdp) onto itself. A quick 
calculation shows that 


®©Vtsm-i = VLS, 


where s’ is the representation A —> Js(h)J -1 of K*. Since dim(s’) =dim(s), 
s' œs, s0 that V~"* ~ V-*S, Hence Vand V-*’ are physically equivalent. 
Suppose conversely that V'rsı and V'zs2 are physically equivalent. Let us 
first assume that the equivalence is effected by a unitary operator. 
Restriction to the subgroup of Euclidean motions (7=0,v=0) implies 
that the corresponding representations are unitarily equivalent and 
enables us to conclude, using lemma 9.13, that s; ~s. We may (and do), 
therefore, assume that sı =s,=s and that V'i: and V'z both act in # = 
L?(P3, X dp). It is then immediately obvious that the multipliers of the 
two representations V‘: and V*‘2°s are similar. But we have observed that 
the multiplier of V%* is similar to m,. So our observation implies that m,, 
and m,, are similar. The determination of the multiplier group of the 
Galilean group carried out in Chapter VII now enables us to conclude that 
7, and 7, must be equal. Thus V1": and V*z's2 cannot be unitarily physically 
equivalent unless 7; = T3 and s; ~8p. If, on the other hand, they are anti- 
unitarily physically equivalent, then V~*::5: and V‘2:52 must be unitarily 
physically equivalent, and we could deduce that T3 = — T1, 8,8. In other 
words, the projective representations V®s (r>0) are, for distinct r and 
inequivalent s, physically inequivalent, and any irreducible projective 
representation of the inhomogeneous Galilean group with nonexact 
multiplier is physically equivalent to one of these. 

The action of V% can be described in another and perhaps more usual 
form by passing to Fourier-Plancherel transforms. Let (cf. (209)) 


F :f—>f 


be the Fourier-Plancherel isomorphism of #2(P?,%dp) onto L2(R3, X dx) 
and let 
Pos = FVF -E 


Then 
us — exnl 27 
(254) Pos = exp( 5 v)r, 
where 
02 02 g2 
(255) VOD Bat et ag 
and 


(256) (Df\(x) = exp ir[}<u,v> +<x—u, v)]s(h)f(3(4) ~*(x—u)). 
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V is self-adjoint with the usual domain. The expression (254) shows that 
the energy of the particle is the operator 


l 
(257) E, = =z- V. 
Notice that we have thus proved the formula (77) of Chapter VIII. Further, 
under the (covering group of the) Euclidean group, the states transform 


according to the law described by 


(258) (Puto, A(x) = 9(h)f(8(A)-(x—u)). 


A look at (258) is enough to conclude that the system corresponding to 
V®s is localizable, and the projection valued measure which leads to the 
position operators is the usual one, # —> Pp, where 


(259) Ps: f—> xef. 


In other words, V®s describes the Schrödinger particle of mass r and spin j 
where dim(s)=2j+1. We have thus obtained the following theorem: 


Theorem 9.18. The irreducible representations of the covering group of 
the inhomogeneous Galilean group G cannot represent nontrivial particles 
and the corresponding systems are not localizable. On the other hand, for each 
+>0 and each half-integer j=0, 4, 1,--- there is an irreducible projective 
representation with a nonexact multiplier of the inhomogeneous Galilean 
group, say V=} and (with s= D’) its action is given by (251) and (252). 
Every irreducible projective representation of G with a nonexact multiplier is 
physically equivalent to exactly one of the V*!. The particle corresponding to 
V= is the Schrodinger particle of mass r and spin j, and its energy operator is 
—(1/27)V where V is the Laplacian (255). 


The analysis of the ordinary representations of the inhomogeneous 
Galilean group was first carried out by Inonu and Wigner [1], [2]. For the 
projective representations, the reader may consult the papers of Bargmann 


[1] and Wightman [1] (cf. also Pauli [1)). 


NOTES ON CHAPTER IX 


The mathematical and physical aspects of the theory of relativisitically 
invariant wave equation have spanned a vast literature; see, for instance, 
Invariant Wave Equations, Lecture Notes in Physics, No. 73, Springer- 
Verlag, Berlin, edited by G. Velo and A. S. Wightman, 1978. For an account 
of the early literature on this subject see E.M.Corson, Introduction to 
Tensors, Spinors and Relativistic Wave Equations, Chelsea reprint of the 
1953 edition); see also Chapter IV of M. A. Naimark, Linear Representations 
of the Lorentz Group (in Russian), Moscow, 1958. 
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